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NANBV DIAGNOSTICS: POLYNUCLEOTIDES USEFUL 



FOR SCREENING FOR HEPATITIS C VIRUS 



10 



Technical Field 



The invention relates to materials and 



methodologies for managing the spread of non-A f non-B 
hepatitis virus (NANBV) infection. More specif ically, it 
15 relates to an etiologic agent of non-A, non-B hepatitis 
( NANBH ) / hepatitis C virus (HCV), and to polynucleotides 
and analogs thereof, which are useful in assays for the 
detection of HCV in biological samples. 
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25 

Background Art 

Non-A, Non-B hepatitis (NANBH) is a 
transmissible disease or family of diseases that are 
believed to be viral-induced , and that are distinguishable 

30 from other forms of viral-associated liver diseases, 
including that caused by the known hepatitis viruses, 
i.e., hepatitis A virus (HAV), hepatitis B virus (HBV), 
and delta hepatitis virus (HDV), as well as the hepatitis 
induced by cytomegalovirus (CMV) or Epstein-Barr virus 

35 (EBV). NANBH was first identified in transfused 

individuals. Transmission from man to chimpanzee and se- 
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rial passage in chimpanzees provided evidence that NANBH 
is due to a transmissible infectious agent or agents. 

Epidemiologic evidence is suggestive that there 
may be three types of NANBH: the water-borne epidemic 
5 type; the blood or needle associated type; and the 
sporadically occurring (community acquired) type. 
However, the number of agents which may be the causative 
of NANBH are unknown. 

There have been a number of candidate NANBV. 
10 See, for example the reviews by Prince (1983), Feinstone 
and Hoofnagle (1984), and Overby (1985, 1986 , 1987) and 
the article by Iwarson (1987). However, there is no proof 
that any of these candidates represent the etiological 
agent of NANBH. 

15 The demand for sensitive, specific methods for 

screening and identifying carriers of NANBV and NANBV 
contaminated blood or blood products is significant. 
Post-transfusion hepatitis (PTH) occurs in approximately 
10% of transfused patients, and NANBH accounts for up to 
20 90% of these cases. The major problem in this disease is 
the frequent progression to chronic liver damage (25-55%). 

Patient care as well as the prevention of 
transmission of NANBH by blood and blood products or by 
close personal contact require reliable screening, 
25 diagnostic and prognostic tools to detect nucleic acids, 
antigens and antibodies related to NANBV. 

Methods for detecting specific polynucleotides 
by hybridization assays are known in the art. See f for 
example, Matthews and Kricka (1988), Analytical Bio- 
30 chemistry 169 :1; Landegren et al. (1988), Science 242 :229; 
and Mittlin (1989), Clinical chem. 35:1819. u.S. Patent 
No. 4,868,105, issued Sept. 9, 1989, and in E.P.O. 
Publication No. 225,807 (published June 16, 1987). 

Applicant discovered a new virus, the Hepatitis 
35 C virus (HCV), which has proven to be the major etiologic 
agent of blood-borne NANBH (BB-NANBH). Applicant's 
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initial work, including a partial genomic sequence of the 
prototype HCV isolate, CDC/HCV1 (also called HCV1), is 
described in E.P.O. Publication No. 318,216 (published 31 
May 1989) and PCT Pub. No. WO 89/04669 (published 1 June 
5 1989). The disclosures of these patent applications, as 
well as any corresponding national patent applications, 
are incorporated herein by reference. These applications 
teach, inter alia, recombinant DNA methods of cloning HCV 
sequences, HCV probe diagnostic techniques, anti-HCV anti- 
10 bodies, and methods of isolating new HCV sequences. 

Disclosure of the Invention 

The present invention is based on HCV sequences 
described in E.P.O. Publication No. 318,216 and in PCT 

15 Pub. No. WO 89/04669, as well as other HCV sequences that 
are described herein. Methods for isolating and/or 
detecting specific polynucleotides by hybridization could 
not be used for screening for HCV until Applicants' 
discovery of HCV. Accordingly, one aspect of the inven- 

20 tion is an oligomer capable of hybridizing to an HCV 

sequence in an analyte polynucleotide strand, wherein the 
oligomer is comprised of an HCV targeting sequence com- 
plementary to at least 4 contiguous nucleotides of HCV 
cDNA shown in Fig. 18. 

25 Another aspect of the invention is a process for 

detecting an HCV sequence in an analyte strand suspected 
of containing an HCV polynucleotide, wherein the HCV 
polynucleotide comprises a selected target region, said 
process comprising: f 

30 (a) providing an oligomer capable of hybridizing 

to an HCV sequence in an analyte polynucleotide strand, 
wherein the oligomer is comprised of an HCV targeting 
sequence complementary to at least 4 contiguous 
nucleotides of HCV cDNA shown in Fig. 18 

35 (b) incubating the analyte strand with the 

oligomer of (a) which allow specific hybrid duplexes to 
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forra between the targeting sequence and the target 
sequence ; and 

(d) detecting hybrids formed between target 
region, if any f and the oligomer. 

5 Yet another aspect of the invention is a method 

for preparing blood free of HCV comprising: 

(a) providing analyte nucleic acids from a 
sample of blood suspected of containing an HCV target 
sequence ; 

10 (b) providing an oligomer capable of hybrid- 

izing to the HCV sequence in an analyte polynucleotide 
strand, if any, wherein the oligomer is comprised of an 
HCV targeting sequence complementary to a sequence of at 
least 8 nucleotides present in a conserved HCV nucleotide 

15 sequence in HCV RNA; 

(c) reacting (a) with (b) under conditions 
which allow the formation of a polynucleotide duplex 
between the targeting sequence and the target sequence, if 
any; 

20 (d) detecting a duplex formed in (c), if any; 

and 

(e) saving the blood from which complexes were 
not detected in (d) . 

25 Brief Description of the Drawings 

Fig, 1 shows the sequence of the HCV cDNA in 
clone 12 f, and the amino acids encoded therein. 

Fig, 2 shows the HCV cDNA sequence in clone k9- 
1, and the amino acids encoded therein. 

Fig. 3 shows the sequence of clone 15e, and the 
amino acids encoded therein. 

Fig. 4 shows the nucleotide sequence of HCV cDNA 
in clone 13i, the amino acids encoded therein, and the 
sequences which overlap with clone 12f . 

35 



30 
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Fig. 5 shows the nucleotide sequence of HCV cDNA 



in clone 26 j f the amino acids encoded therein, and the 
sequences which overlap clone 13i. 

Fig. 6 shows the nucleotide sequence of HCV cDNA 
5 in clone CA59a, the amino acids encoded therein, and the 
sequences which overlap with clones 26 j and K9-1. 

Fig. 7 shows the nucleotide sequence of HCV cDNA 
in clone CA84a, the amino acids encoded therein, and the 
sequences which overlap with clone CAS 9a. 
10 Fig. 8 shows the nucleotide sequence of HCV cDNA 

in clone CA156e, the amino acids encoded therein, and the 
sequences which overlap with CA84a. 

Fig. 9 shows the nucleotide sequence of HCV cDNA 
in clone CA167b, the amino acids encoded therein, and the 
15 sequences which overlap CA156e. 

Fig. 10 shows the nucleotide sequence of HCV 
cDNA in clone CA216a, the amino acids encoded therein, and 
the overlap with clone CA167b. 

Fig. 11 shows the nucleotide sequence of HCV 
20 cDNA in clone CA290a, the amino acids encoded therein, and 
the overlap with clone CA216a. 

Fig. 12 shows the nucleotide sequence of HCV 
cDNA in clone ag30a and the overlap with clone CA290a. 

Fig. 13 shows the nucleotide sequence of HCV 
25 cDNA in clone CA205a, and the overlap with the HCV cDNA 
sequence in clone CA290a. 

Fig. 14 shows the nucleotide sequence of HCV 
cDNA in clone 18g, and the overlap with the HCV cDNA 
sequence in clone ag30a. 
30 Fig- 15 shows the nucleotide sequence of HCV 

cDNA in clone 16jh, the amino acids encoded therein, and 
the overlap of nucleotides with the HCV cDNA sequence in 
clone 15e. 



35 cDNA in clone 6k, the amino acids encoded therein, and the 



Fig. 16 shows the nucleotide sequence of HCV 
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overlap of nucleotides with the HCV cDNA sequence in clone 
16jh. 

Fig. 17 shows the nucleotide sequence of HCV 
cDNA in clone pl31jh, the amino acids encoded therein f and 
5 the overlap of nucleotides with the HCV cDNA sequence in 
clone 6k. 

Fig. 18 shows the the compiled HCV cDNA sequence 
derived from the clones described herein and from the 
compiled HCV cDNA sequence presented in E.P.O. Publication 

10 No. 318,216. The clones from which the sequence was 

derived are 5'-clone32, bll4a, 18g, ag30a, CA205a, CA290a, 
CA216a f pil4a, CA167b, CA156e, CA84a f CA59a, K9-1 (also 
called k9-l),26j, 13i f 12f f 14i, llb f 7f, 7e, 8h, 33c, 
40b, 37b, 35, 36, 81, 32, 33b, 25c, 14c, 8f, 33f, 33g, 

15 39c, 35f, 19g, 26g, 15e, b5a, 16jh, 6k, and p!31jh. In 
the figure the three horizontal dashes above the sequence 
indicate the position of the putative initiator methionine 
codon. Also shown in the figure is the amino acid 
sequence of the putative polyprotein encoded in the HCV 

20 cDNA. Heterogeneities in cloned DNAs of HCV1 are 
indicated by the amino acids indicated above the 
putatively encoded sequence of the large ORF; the 
parentheses indicate that the heterogeneity was detected 
at or near to the 5'- or 3 r - end of the HCV cDNA in the 

25 clone. 

Fig. 19 shows the sequences of capture and label 
probes for the detection of HCV RNA in biological samples. 

Fig. 20 shows schematic alignment of a 
flaviviral polyprotein and a putative HCV polyprotein 

30 encoded in the major ORF of the HCV genome. Also 

indicated in the figure are the possible functions of the 
flaviviral polypeptides cleaved from the flaviral 
polyprotein. In addition, the relative placements of the 
HCV polypeptides, NANB 5 _ 1 ^ 1 and C100, with respect to the 

35 putative HCV polyprotein are indicated. 
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Fig. 22 shows the double-stranded nucleotide 
sequence of the HCV cDNA insert in clone 81 , and the puta- 
tive amino acid sequence of the polypeptide encoded 
therein . 

5 Fig. 23 shows the HCV cDNA sequence in clone 36, 

the segment which overlaps the NANBV cDNA of clone 81 , and 
the polypeptide sequence encoded within clone 36. 

Fig. 24 shows the HCV cDNA sequence in clone 
37b, the segment which overlaps clone 35, and the 
10 polypeptide encoded therein. 

Fig. 25 shows autoradiographs of the HCV cPCR 
assay on RNA derived from liver samples of chimpanzees 
with NANBH (Fig. 25A) and on Italian patients with NANBH 
(Fig. 25B). 

15 Fig. 26A and 26B are graphs showing the temporal 

relationship between the display of liver damage, the 
presence of HCV RNA, and the presence of anti-HCV antibod- 
ies for two chimpanzees with NANBH. 

Fig. 27 shows the nucleotide sequence of HCV 

20 cDNA in clone CA84a, the amino acids encoded therein, and 
the sequences which overlap with clone CA59a. 

Fig. 28 shows the HCV cDNA sequence in clone 
40b, the segment which overlaps clone 37b, and the 
polypeptide encoded therein ♦ 

25 Fig. 29 is an autoradiograph showing the labeled 

amplified products of approximately 300, 30, and 3 CID of 
HCV genomes . 

Fig. 32 shows the nucleotide sequence of HCV 
cDNA in clone 40a. 

30 Fig. 33 is an autoradiograph showing amplified 

products extended from primers derived from conserved 
regions of the HCV genome. 

Fig. 34 shows the HCV cDNA sequence in clone 35 , 
the segment which overlaps clone 36, and the polypeptide 

35 encoded therein. 
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Fig. 37 is a diagram showing the relationship of 
probes and primers derived from the 5' -region of HCV RNA, 
from which the HCV cDNAs in clones ag30a and k9-l are 
derived . 

5 Fig. 38 is an aut ©radiograph of amplified 

products extended from sets of primers derived from ag30a 
and k9-l. 

Fig. 39 shows the aligned nucleotide sequences 

of human isolates 23 and 27 and of HCV1. Homologous 

10 sequences are indicated by the symbol (*). Non homologous 

sequences are in small letters. 

Fig. 40 shows the aligned amino acid sequences 

of human isolates 23 and 27 and of HCV1. Homologous 

sequences are indicated by the symbol ( * ) . Non homologous 

15 sequences are in small letters. 

Fig. 41 shows a half-tone reproduction of an 

autoradiograph of a Northern blot of RNA isolated from the 

liver of a BB-NANBV infected chimpanzee, probed with BB- 

NANBV cDNA of clone 81. 

20 Fig. 43 shows a half-tone reproduction of an 

autoradiograph of nucleic acids extracted from NANBV 

particles captured from infected plasma with anti-NANB c , 

32 5-1- 
1# and probed with P-labeled NANBV cDNA from clone 81. 

Fig. 44 shows reproductions of autoradiographs 

25 of filters containing isolated NANBV nucleic acids, probed 
32 

with P-labeled plus and minus strand DNA probes derived 
from NANBV cDNA in clone 81. 

Fig. 46 shows the . nucleotide consensus sequence 
of human isolate 23, variant sequences are shown below the 
30 sequence line. The amino acids encoded in the consensus 
sequence are also shown. 

Fig. 47 shows the nucleotide consensus sequence 
of human isolate 27, variant sequences are shown below the 
sequence line. The amino acids encoded in the consensus 
35 sequence are also shown. 
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Fig. 48 is a graph showing the relationship of 
the EnvL and EnvR primers to the model flavivirus 
polyprotein and putative HCV polyprotein. 

Fig. 49 shows a comparison of the composite 
5 aligned nucleotide sequences of isolates Thorn, EC1, HCT 
#18 , and HCV1. 

Fig. 50 shows a comparison of the nucleotide 
sequences of EC10 and a composite of the HCV1 sequence; 
the EC10 sequence is on the line above the dots, and the 
10 HCV1 sequence is on the line below the dots. 

Fig. 51 shows a comparison of the amino acid 
sequences 117-308 (relative to HCV1) encoded in the "EnvL" 
regions of the consensus sequences of human isolates HCT 
#18, JH23, JH 27, Thorne, EC1, and of HCV1. 
15 Fig. 52 shows a comparison of the amino acid 

sequences 330-360 (relative to HCV1) encoded in the "EnvR" 
regions of the consensus sequences of human isolates HCT 
#18, JH23, JH 27, Thorne, EC1, and of HCV1. 

Fig. 53 shows the nucleotide sequences of 
20 individual primers in primer mixture 5 '-3. 

Modes for Carrying Out the Invention 

The term "hepatitis C virus" (HCV) has been 

reserved by workers in the field for an heretofore unknown 
25 etiologic agent of NANBH. The prototype isolate of HCV 

has been identified in U.S. S.N. 122,714 (See also E.P.O. 

Publication No. 318,216). The term HCV also includes new 

isolates of the same viral species. As an extension of 

this terminology, the disease caused by HCV, formerly 
30 called blood-borne NANB hepatitis (BB- NANBH) , is called 

hepatitis C. The terms NANBH and hepatitis C may be used 

interchangeably herein. 

HCV is a viral species of which pathogenic 

strains cause BB-NANBH . There may also be attenuated 
35 strains or defective interfering particles derived 

therefrom. As shown infra, the HCV genome is comprised of 
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RNA. It is known that RNA containing viruses have 
relatively high rates of spontaneous mutation, i.e., 
reportedly on the order of 10~ 3 to 10~ 4 per incorporated 
nucleotide (Fields & Knipe (1986)). Therefore, since 
5 heterogeneity and fluidity of genotype are inherent in RNA 
viruses, there are multiple strains/isolates, which may be 
virulent or avirulent, within the HCV species. The 
compositions and methods described herein, enable the 
propagation, identification, detection, and isolation of 

10 the various HCV strains or isolates. 

Several different strains/isolates of HCV have 
been identified. (See infra). One such strain or 
isolate, which is a prototype, is named CDC/HCV1 (also 
called HCV1) . Information from one strain or isolate, 

15 such as a partial genomic sequence, is sufficient to allow 
those skilled in the art using standard techniques to 
isolate new strains /isolates and to identify whether such 
new strains/isolates are HCV. For example, several dif- 
ferent strains /isolates are described infra. These 

20 strains, which were obtained from a number of human sera 
(and from different geographical areas), were isolated 
utilizing the information from the genomic sequence of 
HCV1. 

Using the techniques described in E.P.O. 

25 Publication No. 318,216 and infra, the genomic structure 
and the nucleotide sequence of HCV1 genomic RNA has been 
deduced. The genome appears to be single-stranded RNA 
containing ~10,000 nucleotides. The genome is positive- 
stranded, and possesses a continuous, translational open 

30 reading frame (ORF) that encodes a polyprotein of about 
3,000 amino acids. In the ORF f the structural protein(s) 
appear to be encoded in approximately the first quarter of 
the N-terminus region, with the majority of the 
polyprotein responsible for non-structural proteins. When 

35 compared with all known viral sequences, small but 

significant co-linear homologies are observed with the 
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non-structural proteins of the flavivirus family, and with 
the pestiviruses (which are now also considered to be part 
of the Flavirus family) . 

A schematic alignment of possible regions of a 
5 flaviviral polyprotein (using Yellow Fever Virus as an 
example), and of a putative polyprotein encoded in the 
major ORF of the HCV genome, is shown in Fig. 20. In the 
figure the possible domains of the HCV polyprotein are 
indicated. The flavivirus polyprotein contains, from the 
10 amino terminus to the carboxy terminus, the nucleocapsid 
protein (C), the matrix protein (M) , the envelope protein 
(E), and the non-structural proteins (NS) 1, 2 (a+b), 3, 4 
(a+b), and 5. Based upon the putative amino acids encoded 
in the nucleotide sequence of HCV1, a small domain at the 

15 extreme N-terminus of the HCV polyprotein appears similar 
both in size and high content of basic residues to the 
nucleocapsid protein (C) found at the N-terminus of 
flaviviral polyproteins . The non-structural proteins 
2,3,4, and 5 (NS2-5) of HCV and of yellow fever virus 

20 (YFV) appear to have counter parts of similar size and 
hydropathic ity, although there is divergence of the amino 
acid sequences. However, the region of HCV which would 
correspond to the regions of YFV polyprotein which 
contains the M, E, and NS1 protein not only differs in 

25 sequence, but also appears to be quite different both in 
size and hydropathic ity . Thus, while certain domains of 
the HCV genome may be referred to herein as, for example, 
NS1, or NS2, it should be borne in mind that these 
designations are speculative? there may be considerable 

30 differences between the HCV family and flaviviruses that 
have yet to be appreciated. 

Different strains, isolates or subtypes of HCV 
are expected to contain variations at the amino acid and 
nucleic acids compared with HCV1 . Many isolates are 

35 expected to show much (i.e., more than about 40%) homology 
in the total amino acid sequence compared with HCV1. 
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However, it may also be found that there are other less 
homologous HCV isolates. These would be defined as HCV 
according to various criteria such as, for example, an ORF 
of approximately 9,000 nucleotides to approximately 12,000 
5 nucleotides, encoding a polyprotein similar in size to 
that of HCV1, an encoded polyprotein of similar hydro- 
phobic and/or antigenic character to that of HCV1, and the 
presence of co-linear peptide sequences that are conserved 
with HCV1. In addition, it is believed that the genome 

10 would be a positive-stranded RNA. 

All HCV isolates encode at least one epitope 
which is immunologically identifiable (i.e., im- 
munologically cross-reactive) with an epitope encoded in 
the HCV cDNAs described herein. Preferably the epitope is 

15 contained in an amino acid sequence described herein and 
is unique to HCV when compared to previously known 
pathogens. The uniqueness of the epitppe may be 
determined by its immunological reactivity with anti-HCV 
antibodies and lack of immunological reactivity with anti- 

20 bodies to known pathogens. 

HCV strains and isolates are evolutionarily 
related. Therefore, it is expected that the overall 
homology of the genomes at the nucleotide level may be 
about 40% or greater, probably will be about 50% or 

25 greater, probably about 60% or greater, and even more 

probably about 80% or greater; and in addition that there 
will be corresponding contiguous sequences of at least 
about 13 nucleotides. It should be noted, as shown infra, 
that there are variable and hypervariable regions within 

30 the HCV genome; therefore, the homology in these regions 
is expected to be significantly less than that in the 
overall genome. The correspondence between the putative 
HCV strain genomic sequence and, for example, the CDC/HCV1 
cDNA sequence can be determined by techniques known in the 

35 art. For example, they can be determined by a direct 
comparison of the sequence information of the 
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polynucleotide from the putative HCV, and the HCV cDNA 
sequence(s) described herein. They also can be determined 
by hybridization of the polynucleotides under conditions 
which form stable duplexes between homologous regions ( for 
5 example, those which would be used prior to S 1 digestion), 
followed by digestion with single stranded specific 
nuclease (s), followed by size determination of the 
digested fragments. 

Because of the evolutionary relationship of the 

10 strains or isolates of HCV, putative HCV strains or 
isolates are identifiable by their homology at the 
polypeptide level. Generally, HCV strains or isolates are 
expected to be at least 40% homologous, more than about 
50% homologous, probably more than about 70% homologous, 

15 and even more probably more than about 80% homologous, and 
some may even be more than about 90% homologous at the 
polypeptide level. The techniques for determining amino 
acid sequence homology are known in the art. For example, 
the amino acid sequence may be determined directly and 

20 compared to the sequences provided herein. Alternatively 
the nucleotide sequence of the genomic material of the 
putative HCV may be determined (usually via a cDNA inter- 
mediate), the putative amino acid sequence encoded therein 
can be determined, and the corresponding regions compared. 

25 As used herein, a polynucleotide "derived from" 

a designated sequence refers to a polynucleotide sequence 
which is comprised of a sequence of approximately at least 
about 6 nucleotides, preferably at least about 8 
nucleotides, more preferably at least about 10-12 

30 nucleotides, and even more preferably at least about 15-20 
nucleotides corresponding to a region of the designated 
nucleotide sequence- "Corresponding" means homologous to 
or complementary to the designated sequence. Preferably, 
the sequence of the region from which the polynucleotide 

35 is derived is homologous to or complementary to a sequence 
which is unique to an HCV genome. More preferably, the 
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derived sequence is homologous or complementary to a 
sequence that is unique to all or to a majority of HCV 
isolates. Whether or not a sequence is unique to the HCV 
genome can be determined by techniques known to those of 
5 skill in the art. For example, the sequence can be 
compared to sequences in databanks, e.g., Genebank, to 
determine whether it is present in the uninfected host or 
other organisms. The sequence can also be compared to the 
known sequences of -other viral agents, including those 

10 which are known to induce hepatitis, e.g., HAV, HBV, and 
HDV, and to members of the Flaviviridae . The correspond- 
ence or non-correspondence of the derived sequence to 
other sequences can also be determined by hybridization 
under the appropriate stringency conditions. Hybridiza- 

15 tion techniques for determining the complementarity of 
nucleic acid sequences are known in the art, and are 
discussed infra. See also, for example, Maniatis et al. 
(1982). In addition, mismatches of duplex polynucleotides 
formed by hybridization can be determined by known 

20 techniques, including for example, digestion with a 
nuclease such as SI that specifically digests single- 
stranded areas in duplex polynucleotides. Regions from 
which typical DNA sequences may be "derived" include but 
are not limited to, for example, regions encoding specific 

25 epitopes, as well as non-transcribed and/or non-translated 
regions . 

The derived polynucleotide is not necessarily 
physically derived from the nucleotide sequence shown, but 
may be generated in any manner, including for example, 

30 chemical synthesis or DNA replication or reverse 

transcription or transcription. In addition, combinations 
of regions corresponding to that of the designated 
sequence may be modified in ways known in the art to be 
consistent with an intended use. 

35 The term "recombinant polynucleotide" as used 

herein intends a polynucleotide of genomic, cDNA, 
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semisynthetic, or synthetic origin which, by virtue of its 
origin or manipulation: (1) is not associated with all or 
a portion of a polynucleotide with which it is associated 
in nature, (2) is linked to a polynucleotide other than 
5 that to which it is linked in nature, or (3) does not oc- 
cur in nature. 



to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides . This term refers 

10 only to the primary structure of the molecule. Thus, this 
term includes double- and single-stranded DNA and RNA. It 
also includes known types of modifications, for example , 
labels which are known in the art, methylation, "caps", 
substitution of one or more of the naturally occurring 

15 nucleotides with an analog, internucleotide modifications 
such as, for example, those with uncharged linkages (e.g., 
methyl phosphonates, phosphotriesters , phosphoamidates, 
carbamates, etc.) and with charged linkages (e.g., 
phosphorothioates , phosphorodithioates , etc . ) , those 

20 containing pendant moieties, such as, for example proteins 
(including for e.g., nucleases, toxins, antibodies, signal 
peptides, poly-L-lysine, etc.), those with intercalators 
(e.g., acridine, psoralen*, etc.), those containing 
chelators (e.g., metals, radioactive metals, boron, oxida- 

25 tive metals , etc.), those containing alkylators, those 
with modified linkages (e.g., alpha anomeric nucleic 
acids, etc.), as well as unmodified forms of the 
polynucleotide . 



30 acid contains the sequence that has sequence homology to 
that of mRNA. The "anti-sense strand" contains a sequence 
which is complementary to that of the 'sense strand". 



35 single-stranded and which encodes a viral polypeptide (s ) . 
Examples of positive stranded RNA viruses include 



The term "polynucleotide" as used herein refers 



As used herein, the "sense strand" of a nucleic 



As used herein, a "positive stranded genome" of 
a virus is one in which the genome, whether RNA or DNA, is 
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Togaviridae, Coronaviridae , Retrovir idae , Picornaviridae, 
and Caliciviridae. Included also, are the Flaviviridae, 
which were formerly classified as Togaviradae. See Fields 
& Knipe (1986) . 
5 The term "primer" as used herein refers to an 

oligomer which is capable of acting as a point of initia- 
tion of synthesis of a polynucleotide strand when placed 
under appropriate conditions. The primer will be 
completely or substantially complementary to a region of 
10 the polynucleotide strand to be copied. Thus, under 
conditions conducive to hybridization, the primer will 
anneal to the complementary region of the analyte strand. 
Upon addition of suitable reactants, (e.g., a polymerase, 
nucleotide triphosphates, and the like), the primer is 
15 extended by the polymerizing agent to form a copy of the 
analyte strand. The primer may be single-stranded, or 
alternatively may be partially or fully double-stranded. 

The terms "analyte polynucleotide" and "analyte 
strand" refer to a single- or double-stranded nucleic acid 
20 molecule which is suspected of containing a target 

sequence, and which may be present in a biological sample. 

As used herein, the term "oligomer" refers to 
primers and to probes. The term oligomer does not connote 
the size of the molecule. However, typically oligomers 
25 are no greater than 1000 nucleotides, more typically are 
no greater than 500 nucleotides, even more typically are 
no greater than 250 nucleotides; they may be no greater 
than 100 nucleotides, and may be no greater than 75 
nucleotides, and also may be no greater than 50 
30 nucleotides in length. 

As used herein, the term "probe" refers to a 
structure comprised of a polynucleotide which forms a 
hybrid structure with a target sequence, due to 
complementarity of at least one sequence in the probe with 
35 a sequence in the target region. The polynucleotide 

regions of probes may be composed of DNA, and/or RNA, and/ 
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or synthetic nucleotide analogs. Included within probes 
are "capture probes" and "label probes". Preferably the 
probe does not contain a sequence complementary to 
sequence(s) used to prime the polymerase chain reaction 
5 (PCR). 

As used herein, the term "target region" refers 
to a region of the nucleic acid which is to be amplified 
and/or detected. The term "target sequence" refers to a 
sequence with which a probe or primer will form a stable 
10 hybrid under desired conditions. 

The term "capture probe" as used herein refers 
to a polynucleotide comprised of a single-stranded 
polynucleotide coupled to a binding partner. The single- 
stranded polynucleotide is comprised of a targeting 

15 polynucleotide sequence, which is complementary to a 

target sequence in a target region to be detected in the 
analyte polynucleotide. This complementary region is of 
sufficient length and complementarity to the target 
sequence to afford a duplex of stability which is suf- 

20 ficient to immobilize the analyte polynucleotide to a 
solid surface (via the binding partners). The binding 
partner is specific for a second binding partner; the 
second binding partner can be bound to the surface of a 
solid support, or may be linked indirectly via other 

25 structures or binding partners to a solid support. 

The term "targeting polynucleotide sequence" as 
used herein, refers to a polynucleotide sequence which is 
comprised of nucleotides which are complementary to a 
target nucleotide sequence; the sequence is of sufficient 

30 length and complementarity with the target sequence to 
form a duplex which has sufficient stability for the 
purpose intended. 

The term "binding partner" as used herein refers 
to a molecule capable of binding a ligand molecule with 

35 high specificity, as for example an antigen and an anti- 
body specific therefor. In general, the specific binding 
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partners must bind with sufficient affinity to immobilize 
the analyte copy/complementary strand duplex (in the case 
of capture probes) under the isolation conditions. 
Specific binding partners are known in the art f and 
5 include , for example, biotin and avidin or streptavidin, 
IgG and protein A, the numerous known receptor-ligand 
couples , and complementary polynucleotide strands. In the 
case of complementary polynucleotide binding partners/ the 
partners are normally at least about 15 bases in length, 

10 and may be at least 40 bases in length; in addition, they 
have a content of Gs and Cs of at least about 40% and as 
much as about 60%. The polynucleotides may be composed of 
DNA, RNA f or synthetic nucleotide analogs. 

The term "coupled" as used herein refers to at- 

15 tachment by covalent bonds or by strong non-covalent 
interactions (e.g., hydrophobic interactions, hydrogen 
bonds, etc.). Covalent bonds may be, for example, ester, 
ether, phosphoester , amide, peptide, imide, carbon-sulfur 
bonds, carbon-phosphorus bonds, and the like. 

20 The term "support" refers to any solid or semi- 

solid surface to which a desired binding partner may be 
anchored. Suitable supports include glass, plastic, 
metal, polymer gels, and the like, and may take the form 
of beads, wells, dipstics, membranes, and the like. 

25 The term "label" as used herein refers to any 

atom or moiety which can be used to provide a detectable 
(preferably quantifiable) signal, and which can be at- 
tached to a polynucleotide or polypeptide. 

As used herein, the term "label probe" refers to 

30 an oligomer which is comprised of targeting polynucleotide 
sequence, which is complementary to a target sequence to 
be detected in the analyte polynucleotide. This com- 
plementary region is of sufficient length and 
complementarity to the target sequence to afford a duplex 

35 comprised of the "label probe" and the "target sequence" 
to be detected by the label. The oligomer is coupled to a 
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label either directly, or indirectly via a set of ligand 
molecules with high specificity for each other. Sets of 
ligand molecules with high specificity are described 
supra. , and also includes multimers. 



linear or branched polymers of the same repeating single- 
stranded polynucleotide unit or different single-stranded 
polynucleotide units. At least one of the units has a 
sequence, length, and composition that permits it to 
10 hybridize specifically to a first single- stranded 

nucleotide sequence of interest, typically an analyte or 
an oligomer (e.g., a label probe) bound to an analyte. In 
order to achieve such specificity and stability, this unit 
will normally be at least about 15 nucleotides in length, 
15 typically no more than about 50 nucleotides in length, and 
preferably about 30 nucleotides in length; moreover, the 
content of Gs and Cs will normally be at least about 40% f 
and at most about 60%. In addition to such unit(s), the 
multimer includes a multiplicity of units that are capable 
20 of hybridizing specifically and stably to a second single- 
stranded nucleotide of interest, typically a labeled 
polynucleotide or another multimer. These units are 
generally about the same size and composition as the 
multimers discussed above. When a multimer is designed to 
25 be hybridized to another multimer, the first and second 
oligonucleotide units are heterogeneous (different), and 
do not hybridize with each other under the conditions of 
the selected assay. Thus, multimers may be label probes, 
or may be ligands which couple the label to the probe. 
30 As used herein, the term "viral RNA " , which 

includes HCV RNA, refers to RNA from the viral genome, 
fragments thereof, transcripts thereof, and mutant 
sequences derived therefrom. 



35 a sample of tissue or fluid isolated from an individual, 
including but not limited to, for example, plasma, serum, 



5 



The term "multimer", as used herein, refers to 



As used herein, a "biological sample" refers to 
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spinal fluid , lymph fluid, the external sections of the 
skin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tumors, organs, and also 
samples of in vitro cell culture constituents (including 
5 but not limited to conditioned medium resulting from the 
growth of cells in cell culture medium, putatively virally 
infected cells, recombinant cells, and cell components). 

Description of the Invention 

10 The practice of the present invention will 

employ, unless otherwise indicated, conventional 
techniques of chemistry, molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the 
skill of the art. Such techniques are explained fully in 

15 the literature. See e.g., Maniatis, Fitsch & Sambrook, 

MOLECULAR CLONING; A LABORATORY MANUAL (1982); DNA CLON- 
ING, VOLUMES I AND II (D.N Glover ed. 1985); 
OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); NUCLEIC 
ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins eds . 1984); 

20 the series, METHODS IN ENZYMOLOGY (Academic Press, Inc.), 
particularly Vol. 154 and Vol. 155 (Wu and Grossman, and 
Wu, eds., respectively). All patents, patent applica- 
tions, and publications mentioned herein, both supra and 
infra, are hereby incorporated herein by reference. 

25 The useful materials and processes of the 

present invention are made possible by the identification 
of HCV as the etiologic agent of BB-NANBV, and by the 
provision of a family of nucleotide sequences isolated 
from cDNA libraries which contain HCV cDNA sequences. 

30 These cDNA libraries were derived from nucleic acid 
sequences present in the plasma of an HCV-infected 
chimpanzee. The construction of one of these libraries, 
the "c" library (ATCC No. 40394), is described in E.P.O. 
Publication No. 3 18 f 2 16. 

35 Utilizing the above -described HCV cDNA 

sequences, as well as that described herein, oligomers can 
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be constructed which are useful as reagents for detecting 
viral polynucleotides in biological samples. For example, 
from the sequences it is possible to synthesize DNA 
oligomers of about 8-10 nucleotides, or larger, which are 
5 useful as hybridization probes to detect the presence of 
HCV RNA in, for example, donated blood, blood fractions, 
sera of subjects suspected of harboring the virus, or cell 
culture systems in which the virus is replicating. In 
addition, the novel oligomers described herein enable 
10 further characterization of the HCV genome. 

Polynucleotide probes and primers derived from these 
sequences may be used to amplify sequences present in cDNA 
libraries, and/or to screen cDNA libraries for additional 
overlapping cDNA sequences, which, in turn, may be used to 

15 obtain more overlapping sequences. As indicated infra, 
and in E.P.O. Publication No. 318 , 216, the genome of HCV 
appears to be RNA comprised primarily of a large open 
reading frame (ORF) which encodes a large polyprotein. 

In addition to the above, the information 

20 provided infra allows the identification of additional HCV 
strains or isolates. The isolation and characterization 
of the additional HCV strains or isolates may be ac- 
complished utilizing techniques known to those of skill in 
the art, for example, by isolating the nucleic acids from 

25 body components which contain viral particles and/or viral 
RNA, creating cDNA libraries using the oligomers described 
infra., for screening the libraries for clones containing 
HCV cDNA sequences described infra., and comparing the HCV 
cDNAs from the new isolates with the cDNAs described in 

30 E.P.O. Publication No. 318,216 and infra. Strains or 
isolates which fit within the parameters of HCV, as 
described in the Definitions section, supra. , are readily 
identifiable. Other methods for identifying HCV strains 
will be obvious to those of skill in the art, based upon 

35 the information provided herein. 
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Isolation of the HCV cDNA Sequences 

The oligomers of the invention contain regions 
which form hybrid duplex structures with targeted 
5 sequences in HCV polynucleotides- The HCV polynucleotide 
hybridizing regions of the oligomers may be ascertained 
from the HCV cDNA sequence(s) provided herein, and 
described in E.P.O. Publication No, 318 f 216. A composite 
of HCV cDNA from HCVl f a prototypic HCV, is shown in Fig. 

10 18. The composite sequence is based upon sequence 
information derived from a number of HCV cDNA clones, 
which were isolated from a number of HCV cDNA libraries, 
including the "c" library present in lambda gtll (ATCC No. 
40394), and from human serum. The HCV cDNA clones were 

15 isolated by methods described in E.P.O. Publication No. 
318,216. Briefly, the majority of clones which were 
isolated contained sequences from the HCV cDNA "c M library 
which was constructed using pooled serum from a chimpanzee 
with chronic HCV infection and containing a high titer of 

20 the virus, i.e., at least 10 6 chimp infectious doses/ml 
(CID/ml) . The pooled serum was used to isolate viral 
particles; nucleic acids isolated from these particles was 
used as the template in the construction of cDNA libraries 
to the viral genome. The initial clone, 5-1-1, was 

25 obtained by screening the M c M library with serum from 

infected individuals. After the isolation of the initial 
clone, the remainder of. the sequence was obtained by 
screening with synthetic polynucleotide probes r the 
sequences of which were derived from the 5 '-region and the 

30 3 '-region of the known HCV cDNA sequence(s). 

The description of the methods to retrieve the 
cDNA sequences is mostly of historical interest. The 
resultant sequences (and their complements) are provided 
herein, and the sequences , or any portion thereof , could 

35 be prepared using synthetic methods, or by a combination 
of synthetic methods with retrieval of partial sequences 
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Publication No, 318 f 2 16, 



Oligomer Probes and Primers 
5 Using as a basis the HCV genome (as illustrated 

in Fig. 18)/ and/or preferably conserved regions of the 
HCV genome, oligomers of approximately 8 nucleotides or 
more can be prepared which hybridize with the positive 
strand(s) of HCV RNA or its complement, as well as to HCV 

10 cDNAs . These oligomers can serve as probes for the detec- 
tion (including isolation and/or labeling) of 
polynucleotides which contain HCV nucleotide sequences, 
and/or as primers for the transcription and/or replication 
of targeted HCV sequences. The oligomers contain a 

15 targeting polynucleotide sequence, which is comprised of 
nucleotides which are complementary to a target HCV 
nucleotide sequence; the sequence is of sufficient length 
and complementarity with the HCV sequence to form a duplex 
which has sufficient stability for the purpose intended. 

20 For example, if the purpose is the isolation, via im- 
mobilization , of an analyte containing a target HCV 
sequence, the oligomers would contain a polynucleotide 
region which is of sufficient length and complementarity 
to the targeted HCV sequence to afford sufficient duplex 

25 stability to immobilize the analyte on a solid surface, 
via its binding to the oligomers, under the isolation 
conditions. For example, also, if the oligomers are to 
serve as primers for the transcription and/or replication 
of target HCV sequences in an analyte polynucleotide, the 

30 oligomers would contain a polynucleotide region of suf- 
ficient length and complementarity to the targeted HCV 
sequence to allow the polymerizing agent to continue 
replication from the primers which are in stable duplex 
form with the target sequence, under the polymerizing 

35 conditions. For example, also, if the oligomers are to be 
used as label probes, or are to bind to multimers, the 
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targeting polynucleotide region would be of sufficient 
length and complementarity to form stable hybrid duplex 
structures with the label probes and/or multimers to allow 
detection of the duplex. The oligomers may contain a 
5 minimum of about 4 contiguous nucleotides which are com- 
plementary to targeted HCV sequence; usually the oligomers 
will contain a minimum of about 8 continguous nucleotides 
which are complementary to the targeted HCV sequence , and 
preferably will contain a minimum of about 14 contiguous 
10 nucleotides which are complementary to the targeted HCV 
sequence . 

Suitable HCV nucleotide targeting sequences may 

be comprised of nucleotides which are complementary 

nucleotides selected from the following HCV cDNA 

15 nucleotides, which are shown in Fig, 18, (nn - nn 

3 x x y 

denotes from about nucleotide number x to about nucleotide 
number y) ) : 



20 
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The oligomer, however, need not consist only of 

10 the sequence which is complementary to the targeted HCV 
sequence. It may contain in addition, nucleotide 
sequences or other moieties which are suitable for the 
purposes for which the oligomers are used. For example, 
if the oligomers are used as primers for the amplification 

15 of HCV sequences via PCR, they may contain sequences 

which, when in duplex, form restriction enzyme sites which 
facilitate the cloning of the amplified sequences. For 
example, also, if the oligomers are to be used as "capture 
probes" in hybridization assays (described infra), they 

20 would contain in addition a binding partner which is 

coupled to the oligomer containing the nucleotide sequence 
which is complementary to the targeted HCV sequence. 
Other types of moieities or sequences which are useful of 
which the oligomers may be comprised or coupled to, are 

25 those which are known in the art to be suitable for a 

variety of purposes, including the labeling of nucleotide 
probes . 

The preparation of the oligomers is by means 
known in the art, including , for example , by methods which 
30 include excision, transcription, or chemical synthesis. 
The target sequences and/or regions of the genome which 
are selected to which the targeting polynucleotides of the 
oligomers are complementary depend upon the purpose. For 
example, if the goal is to screen for the presence of HCV 
*. 35 in biological samples (e.g. blood), the preferred 

oligomers would be used as probes and/or primers, and 
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would hybridize to conserved regions of the HCV genome. 
Some of the conserved regions of the HCV genome to which 
the oligomers may bind are described herein , for example, 
the regions which include nucleotide numbers from about 
5 the 5 -terminus to about 200 , or from about 4000 to about 
5000, or from about 8000 to about 9040 as shown in Fig. 
18, or preferably nucleotides -318 to 174, 4056 to 4448, 
and 4378 to 4902. Other regions of the genome which are 
conserved are readily ascertainable by comparison of the 

10 nucleotide sequences of various isolates of HCV f including 
the prototype HCV, HCV1. Methods for conducting 
comparisons between genotypes to determine conserved and 
nonconserved regions are known in the art, and examples of 
these methods are disclosed herein. 

15 In the basic nucleic acid hybridization assay, 

single-stranded analyte nucleic acid (either DNA or RNA) 
is hybridized to a nucleic acid probe, and resulting 
duplexes are detected. The probes for HCV polynucleotides 
(natural or derived) are a length which allows the detec- 

20 tion of unique viral sequences by hybridization. While 6- 
8 nucleotides may be a workable length, sequences of 10-12 
nucleotides are preferred, and about 20 nucleotides or 
more appears optimal. Preferably, these sequences will 
derive from regions which lack heterogeneity. These 

25 probes can be prepared using routine methods, including 

automated oligonucleotide synthetic methods- Among useful 
probes, for example, are those derived from the newly 
isolated clones disclosed herein, as well as the various 
oligomers useful in probing cDNA libraries, set forth 

30 below. A complement to any unique portion of the HCV 

genome will be satisfactory. For use as probes , complete 
complementarity is desirable, though it may be unnecessary 
as the length of the fragment is increased. 

For use of such probes as agents to detect the 

35 presence of HCV polynucleotides (for example in screening 
for contaminated blood) , the biological sample to be 
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analyzed, such as blood or serum, may be treated, if 
desired, to extract the nucleic acids contained therein. 
The resulting nucleic acid from the sample may be 
subjected to gel electrophoresis or other size separation 
5 techniques; alternatively, the nucleic acid sample may be 
dot blotted without size separation. In order to form 
hybrid duplexes with the targeting sequence of the probe, 
the targeted region of the analyte nucleic acid must be in 
single stranded form- Where the sequence is naturally 

10 present in single stranded form, denaturation will not be 
required. However, where the sequence is present in 
double stranded form, the sequence will be denatured. De- 
naturation can be carried out by various techniques known 
in the art. Subsequent to denaturation, the analyte 

15 nucleic acid and probe are incubated under conditions 
which promote stable hybrid formation of the target 
sequence in the probe with the putative targeted sequence 
in the analyte, and the resulting duplexes containing the 
probe(s) are detected. 

20 Detection of the resulting duplex, if any, is 

usually accomplished by the use of labeled probes; 
alternatively, the probe may be unlabeled, but may be 
detectable by specific binding with a ligand which is 
labeled, either directly or indirectly. Suitable labels, 

25 and methods for labeling probes and ligands are known in 
the art, and include, for example, radioactive labels 
which may be incorporated by known methods (e.g., nick 
translation or kinasing), biotin, fluorescent groups, 
chemiluminescent groups (e.g., dioxetanes, particularly 

30 triggered dioxetanes), enzymes, antibodies, and the like. 

The region of the probes which are used to bind 
to the analyte can be made completely complementary to the 
HCV genome. Therefore, usually high stringency conditions 
are desirable in order to prevent false positives. 

35 However, conditions of high stringency should only be used 
if the probes are complementary to regions of the viral 
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genome which lack heterogeneity. The stringency of 
hybridization is determined by a number of factors during 
hybridization and during the washing procedure , including 
temperature, ionic strength , length of time, and 
5 concentration of formamide. These factors are outlined 
in, for example, Maniatis, T. (1982). 

Variations of this basic scheme which are known 
in the art, including those which facilitate separation of 
the duplexes to be detected from extraneous materials and/ 

10 or which amplify the signal from the labeled moiety, may 
also be used. A number of these variations are reviewed 
in, for example: Matthews and Kricka (1988), Analytical 
Biochemistry 169 :1; Landegren et al. (1988), Science 
242:229; and Mittlin (1989), Clinical chem. 35:1819. 

15 These and the following publications describing assay 
formats are hereby incorporated by reference herein. 
Probes suitable for detecting HCV in these assays are 
comprised of sequences which hybridize with target HCV 
polynucleotide sequences to form duplexes with the analyte 

20 strand, wherein the duplexes are of sufficient stability 
for detection in the specified assay system. 

A suitable variation is, for example, one which 
is described in U.S. Patent No. 4,868,105, issued Sept. 9 r 
1989, and in E.P.O. Publication No. 225,807 (published 

25 June 16, 1987). These publications describe a solution 
phase nucleic acid hybridization assay in which the 
analyte nucleic acid is hybridized to a labeling probe set 
and to a capturing probe set. The probe-analyte complex 
is coupled by hybridization with a solid-supported capture 

30 probe that is complementary to the capture probe set. 

This permits the analyte nucleic acid to be removed from 
solution as a solid phase complex. Having the analyte in 
the form of a solid phase complex facilitates subsequent 
separation steps in the assay. The labeling probe set is 

35 complementary to a labeled probe that is bound through 
hybridization to the solid phase/analyte complex. 
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Generally/ it is expected that the HCV genome 

sequences will be present in serum of infected individuals 

2 3 

at relatively low levels, i.e., at approximately 10 -10 
chimp infectious doses (CID) per ml. This level may 
5 require that amplification techniques be used in 

hybridization assays. Such techniques are known in the 
art. For example, the Enzo Biochemical Corporation "Bio- 
Bridge" system uses terminal deoxynucleotide transferase 
to add unmodified 3 ' -poly-dT-tails to a DNA probe. The 
10 poly dT-tailed probe is hybridized to the target 

nucleotide sequence, and then to a biot in-modified poly-A. 
PCT Publication 84/03520 and EP Publication No. 124221 
describe a DNA hybridization assay in which: (1) analyte 
is annealed to a single-stranded DNA probe that is com- 
15 plementary to an enzyme-labeled oligonucleotide? and (2) 
the resulting tailed duplex is hybridized to an enzyme- 
labeled oligonucleotide. EPA 204510 describes a DNA 
hybridization assay in which analyte DNA is contacted with 
a probe that has a tail, such as a poly-dT tail, an ampli- 
20 fier strand that has a sequence that hybridizes to the 

tail of the probe, such as a poly-A sequence, and which is 
capable of binding a plurality of labeled strands. A type 
of hybridization assay which is described in E.P.O. 
Publication No. 317,077 (published May 24, 1989), which 
25 should detect sequences at the level of approximately 10 6 / 
ml, utilizes nucleic acid multimers which bind to single- 
stranded analyte nucleic acid, and which also bind to a 
multiplicity of single-stranded labeled oligonucleotides. 
A particularly desirable technique may involve amplifica- 
30 tion of the target HCV sequences in sera approximately 

10,000 fold (i.e., to approximately 10 6 sequences /ml ) , as 
part of the hybridization system. The amplification may 
be accomplished, for example, by the polymerase chain re- 
actions (PCR) technique described by Saiki et al . (1986), 
35 by Mullis, U.S. Patent No. 4,683,195, and by Mullis et al. 
U.S. Patent No. 4,683,202. Amplification may be prior to, 
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or preferably subsequent to purification of the HCV target 
sequence. For example, amplification may be utilized in 
conjunction with the assay methods described in U.S. Pat- 
ent No. 4 , 868, 105 , or if even further amplification is 
5 desired, in conjunction with the hybridization system 
described in E.P.O. Publication No. 317,077. 

Preferred methods for detecting HCV sequences in* 
an analyte polynucleotide strand are based upon the 
hybridization detection methods described in U.S. Patent 

10 No. 4,868,105 and in E.P.O. Publication No. 317,077. 

These methods are solution-phase sandwich hybridization 
assays which utilize both capture and label probes which 
hybridize to target sequences in an analyte nucleic acid. 
In the use of these assays to screen biological samples 

15 for HCV, the probes used would bind to conserved regions 
of the HCV genome. The capture and label probes may be 
interspersed in their binding to the target sequence. 
Alternatively, in a preferred mode the capture and label 
probes are in sets, and the probes of one set do not 

20 intersperse with the probes of another set. In the latter 
mode, preferably the set(s) of multiple capture probes 
hybridize to the most conserved regions of the genome, 
while the set(s) of multiple label probes may hybridize to 
regions which exhibit small amounts of divergence. For 

25 example, using the prototype HCV1 cDNA sequence shown in 
Fig. 18, probes could be used which hybridize to sequences 
in the region of nucleotides from about -318 to about 174, 
and/or nucleotides in the region of about 4378 to about 
4902, and/or nucleotides in the region of from about 4056 

30 to about 4448. The preferred probes would hybridize to 
sequences in the 5 '-region of the HCV genome , since, as 
shown infra., this region appears to be highly conserved. 
Thus, preferred probes may hybridize to, for example, 
nucleotides from about -318 to about 174 as shown in Fig. 

35 18. Probes could be used which hybridize to either the 
positive strand in conserved regions, and/or its comple- 
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ment, depending upon the purpose, for example, to detect 
viral genomic sequences, or to detect HCV cDNA sequences 
resulting from PCR amplification, or to detect replicative 
intermediates to the positive HCV RNA strand, 

5 

Detection of HCV RNA and Polynucleotides Derived Therefrom 
Using an HCV/cPCR Method 

A particularly useful method for detecting HCV 
RNA or polynucleotides derived from HCV RNA is the HCV/ 
10 cPCR method, which is a subject of the herein application, 
and which utilizes the polymerase chain reaction technique 
(PCR) which is described by Saiki et al. (1986), by Mullis 
in U.S. Pat. No. 4,683,195, and by Mullis et al. in U.S. 
Patent No. 4,683,202. The HCV/cPCR method utilizes prim- 

15 ers and probes derived from the information provided 
herein concerning the nature of the HCV genome. 

Generally, in the PCR technique, short 
oligonucleotide primers are prepared which match opposite 
ends of a desired sequence. The sequence between the 

20 primers need not be known. A sample of polynucleotide is 
extracted and denatured, preferably by heat, and hybrid- 
ized with oligonucleotide primers which are present in 
molar excess. Polymerization is catalyzed by a template- 
and primer-dependent polymerase in the presence of 

25 deoxynucleotide triphosphates or nucleotide analogs 
( dNTPs ) . This results in two "long products" which 
contain the respective primers at their 5 '-termini, 
covalently linked to the newly synthesized complements of 
the original strands. The replicated DNA is again de- 

30 natured, hybridized with oligonucleotide primers, returned 
to polymerizing conditions, and a second cycle of replica- 
tion is initiated. The second cycle provides the two 
original strands, the two long products from cycle 1, and 
two "short products" replicated from the long products. 

35 The short products contain sequences (sense or antisense) 
derived from the target sequence, flanked at the 5'- and 
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3 '-termini with primer sequences- On each additional 
cycle, the number of short products is replicated 
exponentially. Thus, this process causes the amplifica- 
tion of a specific target sequence. 
5 In the method, a sample is provided which is 

suspected of containing HCV RNA, or a fragment thereof. 
The sample is usually taken from an individual suspected 
of having NANBH; however, other sources of the sample are 
included, e.g., conditioned medium or cells from in vitro 
10 systems in which the virus has been replicated. The 
sample, however, must contain the target nucleic acid 
sequence(s) . 

The sample is then subjected to conditions which 
allow reverse transcription of HCV RNA into HCV cDNA. 

15 Conditions for reverse transcribing RNA are known to those 
of skill in the art, and are described in, for example, 
Maniatis et al. (1982), and in Methods in Enzymology. A 
preferred method of reverse transcription utilizes reverse 
transcriptase from a variety of sources, including re- 

20 combinant molecules, and isolated from, for example, a 
retrovirus, preferably from avian myeloblastosis virus 
(AMV), and suitable conditions for the transcription. The 
HCV cDNA product of reverse transcription is in a RNA:DNA 
hybrid, which results from the first round of reverse 

25 transcription; subsequently, DNA: DNA hybrids result from 
two or more rounds of transcription. 

The HCV cDNA resulting from reverse transcrip- 
tion is then subjected to PCR to amplify the target 
sequence. In order to accomplish this f the HCV cDNA is 

30 denatured, and the separated strands are hybridized with 
primers which flank the target sequence. 

Strand separation may be accomplished by any 
suitable denaturing method, including physical, chemical , 
or enzymatic means, which are known to those of skill in 

35 the art. A preferred method, which is physical, involves 
heating the nucleic acid until it is completely (>99%) 
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denatured. Typical heat denaturation involves 
temperatures ranging from about 80°C to about 105°C, for 
times ranging from about 1 to 10 minutes . 

After hybridization of the HCV cDNA with the 
5 primers , the target HCV sequences are replicated by a 

polymerizing means which utilizes a primer oligonucleotide 
to initiate the synthesis of the replicate chain. The 
primers are selected so that they are complementary to 
sequences of the HCV genome. Oligomeric primers which are 
10 complementary to regions of the sense and antisense 
strands of HCV cDNA can be designed from the HCV cDNA 
sequences from the composite cDNA sequence provided in 
Fig. 18. 

The primers are selected so that their relative 

15 positions along a duplex sequence are such that an exten- 
sion product synthesized from one primer, when it is 
separated from its template (complement), serves as a 
template for the extension of the other primer to yield a 
replicate chain of defined length. 

20 The primer is preferably single stranded for 

maximum efficiency in amplification, but may alternatively 
be double stranded. If double stranded, the primer is 
first treated to separate its strands before being used to 
prepare extension products. Preferably , the primer is an 

25 oligodeoxyribonucleotide. The primer must be sufficiently 
long to prime the synthesis of extension products in the 
presence of the agent for polymerization. The exact 
lengths of the primers will depend on many factors, 
including temperature and source of the primer and use of 

30 the method. For example, depending on the complexity of 
the target sequence f the oligonucleotide primer typically 
contains about 15-45 nucleotides,, although it may contain 
more or fewer nucleotides. Short primer molecules gener- 
ally require cooler temperatures to form sufficiently 

35 stable hybrid complexes with the template. 
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The primers used herein are selected to be 
"substantially" complementary to the different strands of 
each specific sequence to be amplified. Therefore , the 
primers need not reflect the exact sequence of the 
5 template, but must be sufficiently complementary to 

selectively hybridize with their respective strands. For 
example, a non-complementary nucleotide fragment may be 
attached to the 5 f -end of the primer , with the remainder 
of the primer sequence being complementary to the strand. 

10 Alternatively, non-complementary bases or longer sequences 
can be interspersed into the primer, provided that the 
primer has sufficient complementarity with the sequence of 
one of the strands to be amplified to hybridize therewith, 
and to thereby form a duplex structure which can be 

15 extended by the polymerizing means. The non-complementary 
nucleotide sequences of the primers" may include restric- 
tion enzyme sites. Appending a restriction enzyme site to 
the end(s) of the target sequence would be particularly 
helpful for cloning of the target sequence. 

20 It will be understood that "primer", as used 

herein, may refer to more than one primer, particularly in 
the case where there is some ambiguity in the information 
regarding the terminal sequence(s) of the target region to 
be amplified. Hence, a "primer" includes a collection of 

25 primer oligonucleotides containing sequences representing 
the possible variations in the sequence or includes 
nucleotides which allow a typical basepairing. One of the 
primer oligonucleotides in this collection will be 
homologous with the end of the target sequence. A 

30 specific case is shown in the Examples, where oligomer 
sets of 44-mers and 45-mers were utilized to prime the 
amplification of a potentially variant region of the HCV 
genome . 

It is anticipated that there will be a variety 
35 of strains or isolates of HCV with sequences which deviate 
from HCV1, the prototype strain. Therefore, in order to 
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detect variant strains it is preferable to construct prim- 
ers which hybridize to conserved regions of the HCV 
genome . The conserved regions may be determined by 
comparing the nucleotide or amino acid sequences of 
5 several HCV strains/isolates. There appear to be at least 
three regions of conserved amino acid in the HCV genome, 
described supra • , from which primers may be derived . 
These regions are believed to be. The primers described 
infra . , in the Examples , are derived from what are 

10 believed to be conserved regions of HCV f based upon 
sequence homology to that of the Flaviviruses • 

The oligonucleotide primers may be prepared by 
any suitable method- Methods for preparing 
oligonucleotides of specific sequence are known in the 

15 art, and include, for example, cloning and restriction of 
appropriate sequences, and direct chemical synthesis. 
Chemical synthesis methods may include, for example, the 
phosphotriester method described by Narang et al. (1979), 
the phosphodiester method disclosed by Brown et al. 

20 (1979), the diethylphosphoramidate method disclosed in 
Beaucage et al. (1981), and the solid support method in 
U.S. Patent No. 4,458,066. 

The primers may be labeled, if desired, by in- 
corporating means detectable by spectroscopic, photo- 

25 chemical, biochemical, immunochemical, or chemical means. 
Template-dependent extension of the 
oligonucleotide primer (s) is catalyzed by a polymerizing 
agent in the presence of adequate amounts of the four 
deoxyribonucleotide triphosphates (dATP r dGTP, dCTP and 

30 dTTP) or analogs, in a reaction medium which is comprised 
of the appropriate salts, metal cations, and pH buffering 
system. Suitable polymerizing agents are enzymes known to 
catalyze primer- and template-dependent DNA synthesis. 
Known DNA polymerases include, for example, coli DNA 

35 polymerase I or its Klenow fragment, T 4 DNA polymerase, 
and Taq DNA polymerase. The reaction conditions for 
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catalyzing DNA synthesis with these DNA polymerases are 
known in the art. 

The products of the synthesis are duplex 
molecules consisting of the template strands and the 
5 primer extension strands , which include the target 

sequence- These products , in turn, serve as template for 
another round of replication. In the second round of 
replication, the primer extension strand of the first 
cycle is annealed with its complementary primer; synthesis 

10 yields a "short" product which is bounded on both the 5'- 
and the 3 '-ends by primer sequences or their complements. 
Repeated cycles of denaturation, primer annealing, and 
extension result in the exponential accumulation of the 
target region defined by the primers. Sufficient cycles 

15 are run to achieve the desired amount of polynucleotide 

containing the target region of nucleic acid. The desired 
amount may vary, and is determined by the function which 
the product polynucleotide is to serve. 

The PCR method can be performed in a number of 

20 temporal sequences. For example, it can be performed 

step-wise, where after each step new reagents are added, 
or in a fashion where all of the reagents are added 
simultaneously, or in a partial step-wise fashion, where 
fresh reagents are added after a given number of steps. 

25 In a preferred method, the PCR reaction is car- 

ried out as an automated process which utilizes a 
thermostable enzyme. In this process the reaction mixture 
is cycled through a denaturing region, a primer annealing 
region, and a reaction region. A machine may be employed 

30 which is specifically adapted for use with a thermostable 
enzyme, which utilizes temperature cycling without a 
liquid handling system, since the enzyme need not be added 
at every cycle. This type of machine is commercially 
available from Perkin Elmer Cetus Corp. 

35 After amplification by PCR f the target 

polynucleotides are detected by hybridization with a probe 
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polynucleotide which forms a stable hybrid with that of 
the target sequence under stringent to moderately 
stringent hybridization and wash conditions. If it is 
expected that the probes will be completely complementary 
5 (i.e., about 99% or greater) to the target sequence, 

stringent conditions will be used. If some mismatching is 
expected, for example if variant strains are expected with 
the result that the probe will not be completely com- 
plementary, the stringency of hybridization may be 

10 lessened. However, conditions are chosen which rule out 
nonspecific/adventitious binding. Conditions which affect 
hybridization, and which select against nonspecific bind- 
ing are known in the art, and are described in, for 
example, Maniatis et al. (1982). Generally, lower salt 

15 concentration and higher temperature increase the 
stringency of binding. For example, it is usually 
considered that stringent conditions are incubation in 
solutions which contain approximately 0-1 X SSC, 0.1% SDS, 
at about 65°C incubation/wash temperature, and moderately 

20 stringent conditions are incubation in solutions which 
contain approximately 1-2 X SSC, 0.1% SDS and about 50°- 
65°C incubation/wash temperature. Low stringency condi- 
tions are 2 X SSC and about 30°-50°C. 

Probes for HCV target sequences may be derived 

25 from the HCV cDNA sequence shown in Fig. 18, or from new 
HCV isolates. The HCV probes may be of any suitable 
length which span the target region, but which exclude the 
primers, and which allow specific hybridization to the 
target region. If there is to be complete 

30 complementarity, i.e., if the strain contains a sequence 
identical to that of the probe, since the duplex will be 
relatively stable under even stringent conditions, the 
probes may be short, i.e., in the range of about 10-30 
base pairs. If some degree of mismatch is expected with 

35 the probe, i.e., if it is suspected that the probe will 

hybridize to a variant region, the probe may be of greater 



WO 90/14436 PCT/US90/02853 

-50- 

length, since length seems to counterbalance some of the 
effect of the mismatch(es) . An example of this is found 
in the Examples, where the probe was designed to bind to 
potential variants of HCV1. In this case, the primers 
5 were designed to bind to HCV cDNA derived from a hypo- 
thetical conserved region of the HCV genome, and the 
target region was one which potentially contained varia- 
tions (based upon the Flavivirus model). The probe used 
to detect the HCV target sequences contained approximately 

10 268 base pairs. 

The probe nucleic acid having a sequence com- 
plementary to the target sequence may be synthesized using 
similar techniques described supra, for the synthesis of 
primer sequences. If desired, the probe may be labeled. 

15 Appropriate labels are described supra. 

In some cases, it may be desirable to determine 
the length of the PCR product detected by the probe. This 
may be particularly true if it is suspected that variant 
HCV strains may contain deletions within the target 

20 region, or if one wishes to confirm the length of the PCR 
product. In such cases it is preferable to subject the 
products to size analysis as well as hybridization with 
the probe. Methods for determining the size of nucleic 
acids are known in the art, and include, for example, gel 

25 electrophoresis, sedimentation in gradients, and gel 
exclusion chromatography. 

The presence of the target sequence in a bio- 
logical sample is detected, by determining whether a hybrid 
has been formed between the HCV polynucleotide probe and 

30 the nucleic acid subjected to the PCR amplification 

technique. Methods to detect hybrids formed between a 
probe and a nucleic acid sequence are known in the art. 
For example, for convenience, an unlabeled sample may be 
transferred to a solid matrix to which it binds, and the 

35 bound sample subjected to conditions which allow specific 
hybridization with a labeled probe; the solid matrix is 
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than examined for the presence of the labeled probe. 
Alternatively , if the sample is labeled, the unlabeled 
probe is bound to the matrix, and after the exposure to 
the appropriate hybridization conditions, the matrix is 
5 examined for the presence of label. Other suitable 
hybridization assays are described supra. 



Determination of Variant HCV Sequences Using PCR 

In order to identify variant HCV strains, and 
10 thereby to design probes for those variants, the above 
described HCV/cPCR method is utilized to amplify variant 
regions of the HCV genome, so that the nucleotide 
sequences of these variant target regions can be 
determined. Generally, variant types of HCV might be 
15 expected to occur in different geographic locations than 
that in which the HCV1 strain is predominant, for example, 
Japan, Africa, etc.; or in different vertebrate species 
which are also infected with the virus. Variant HCV may 
also arise during passage in tissue culture systems, or be 
20 the result of spontaneous or induced mutations. 

In order to amplify the variant target region, 
primers are designed to flank the suspect region, and 
preferably are complementary to conserved regions . Prim- 
ers to two regions of HCV which are probably conserved, 
25 based upon the Plavivirus model, are described in the 
Examples. These primers and probes may be designed 
utilizing the sequence information for the HCV1 strain 
provided in Fig. 18. 

Analysis of the nucleotide sequence of the 
30 target region(s) may be by direct analysis of the PCR 
amplified products. A process for direct sequence 
analysis of PCR amplified products is described in Saiki 
et al. (1988) . 

Alternatively, the amplified target sequence(s) 
35 may be cloned prior to sequence analysis. A method for 
the direct cloning and sequence analysis of enzymatically 
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amplified genomic segments has been described by Scharf 
(1986). In the method, the primers used in the PCR 
technique are modified near their 5 '-ends to produce 
convenient restriction sites for cloning directly into, 
5 for example, an M13 sequencing vector. After amplifica- 
tion, the PCR products are cleaved with the appropriate 
restriction enzymes. The restriction fragments are 
ligated into the Ml 3 vector, and transformed into, for 
example, a JM 103 host, plated out, and the resulting 
10 plaques are screened by hybridization with a labeled 
oligonucleotide probe. Other methods for cloning and 
sequence analysis are known in the art. 

Universal Primers for Flaviviruses and for HCV 

15 Studies of the nature of the genome of the HCV, 

utilizing probes derived from the HCV cDNA, as well as 
sequence information contained within the HCV cDNA, are 
suggestive that HCV is a Flavi-like virus. These studies 
are described in E.P.O. publication No. 318,216 owned by 

20 the herein assignee, and which is incorporated herein in 
its entirety. A comparison of the HCV cDNA sequence 
derived from the HCV cDNA clones with known sequences of a 
number of Flaviviruses show that HCV contains sequences 
which are homologous to conserved sequences in the 

25 Flaviviruses. These conserved sequences may allow the 
creation of primers which may be universal in their ap- 
plication for amplification of target regions of 
Flaviviruses, and for HCV.. These sequences are the 16-mer 
or smaller sequences from the 3 '-termini of the primers 

30 described in the Examples. Identification of the species 
is then accomplished utilizing a probe specific for the 
species. The genomes of a number of Flaviviruses are 
known in the art, and include , for example, Japanese 
Encephalitis Virus (Sumiyoshi et al. (1987)), Yellow Fever 

35 Virus (Rice et al. (1985)), Dengue Type 2 Virus (Hahn et 
al. (1988)), Dengue Type 4 Virus (Mackow (1987)), and West 
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Nile Virus (Castle et al. (1986)), Identification of HCV 
RNA is accomplished utilizing a probe specific for HCV, 
the sequence of which can be determined the HCV cDNA 
sequences provided herein. 
5 Alternatively, utilization of sets of probe(s) 

designed to account for codon degeneracy and therefore 
contain common sequences to the Flaviviruses and to HCV, 
as determined by a comparison of HCV amino acid sequences 
with the known sequences of the Flaviviruses, allows a 
10 general detection system for these viruses. 

Construction of Desired DNA Sequences 

Synthetic oligonucleotides may be prepared using 
an automated oligonucleotide synthesizer as described by 
15 Warner (1984). If desired the synthetic strands may be 

labeled with P by treatment with polynucleotide kinase 

32 

in the presence of P-ATP, using standard conditions for 
the reaction. 

DNA sequences, including those isolated from 

20 cDNA libraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as 
described by Zoller (1982). Briefly, the DNA to be 
modified is packaged into phage as a single stranded 
sequence, and converted to a double stranded DNA with DNA 

25 polymerase using, as a primer, a synthetic oligonucleotide 
complementary to the portion of the DNA to be modified, 
and having the desired modification included in its own 
sequence. The resulting double stranded DNA is 
transformed into a phage supporting host bacterium. 

30 Cultures of the transformed bacteria, which contain 

replications of each strand of the phage, are plated in 
agar to obtain plaques. Theoretically , 50% of the new 
plaques contain phage having the mutated sequence, and the 
remaining 50% have the original sequence. Replicates of 

35 the plaques are hybridized to labeled synthetic probe at 
temperatures and conditions which permit hybridization 
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with the correct strand, but not with the unmodified 
sequence. The sequences which have been identified by 
hybridization are recovered and cloned. 

5 Kits for Screening for HCV Derived Polynucleotides 

Oligomers which are probes and/or primers for 
amplification and/or screening of samples for HCV can be 
packaged into kits. Kits for screening for HCV sequences 
include the oligomeric probe DNAs • Kits for amplification 

10 of HCV sequences may include the oligomeric primers used 
in the amplification. The kits usually contain the probes 
or primers in a premeasured or predetermined amount , as 
well as other suitably packaged reagents and materials , in 
separate suitable containers, needed for the particular 

15 hybridization and/or amplification protocol(s). For 

example, the kit may contain standards, buffers, supports, 
enzymes, substrates, label probes, binding partners, and/ 
or instructions for conducting the test. 

20 Examples 

Described below are examples of the present 
invention which are provided only for illustrative 
purposes, and not to limit the scope of the present inven- 
tion. 

25 

Isolation and Sequence of Overlapping 
HCV cDNA Clones 13i, 26i, CA59a, CA84a, CA156e and CA167b 

The clones 13i, 26 j, CA59a, CA84a, CA156e and 
CA167b were isolated from the lambda-gtll library which 

30 contains HCV cDNA (ATCC No. 40394), the preparation of 
which is described in E.P.O. Publication No. 318,216 
(published 31 May 1989) f and wo 89/04669 (published 1 June 
1989). Screening of the library was with the probes 
described infra., using the method described in Huynh 

35 (1985). The frequencies with which positive clones ap- 
peared with the respective probes was about 1 in 50,000. 
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The isolation of clone 13i was accomplished 
using a synthetic probe derived from the sequence of clone 
12 f . The sequence of the probe was: 

5 5 ' GAA CGT TGC GAT CTG GAA GAC AGG GAC AGG 3 ' . 

The isolation of clone 26 j was accomplished 
using a probe derived from the 5 '-region of clone K9-1. 
The sequence of the probe was: 

10 

5 ' TAT CAG TTA TGC CAA CGG AAG CGG CCC CGA 3 ' . 

The isolation procedures for clone 12 f and for 
clone k9-l (also called K9-1) are described in E.P.O. 

15 Publication No. 318,216 , and their sequences are shown in 
Figs. 1 and 2, respectively. The HCV cDNA sequences of 
clones 13i and 26 j, are shown in Figs. 4 and 5, 
respectively. Also shown are the amino acids encoded 
therein, as well as the overlap of clone 13i with clone 

20 12f, and the overlap of clone 26 j with clone 13i. The 

sequences for these clones confirmed the sequence of clone 
K9-1. Clone K9-1 had been isolated from a different HCV 
cDNA library (See E.P.O. Publication No. 218,316). 

Clone CA59a was isolated utilizing a probe based 

25 upon the sequence of the 5 '-region of clone 26 j. The 
sequence of this probe was: 

5 ' CTG GTT AGC AGG GCT TTT CTA TCA CCA CAA 3 ' . 

30 A probe derived from the sequence of clone CA59a 

was used to isolate clone CA84a. The sequence of the 
probe used for this isolation was: 

5 ' AAG GTC CTG GTA GTG CTG CTG CTA TTT GCC 3 ' . 
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Clone CA156e was isolated using a probe derived 
from the sequence of clone CA84a. The sequence of the 
probe was: 

5 5 ' ACT GGA CGA CGC AAG GTT GCA ATT GCT CTA 3 ' . 

Clone CA167b was isolated using a probe derived 
from the sequence of clone CA 156e. The sequence of the 
probe was: 



10 



5 ' TTC GAC GTC ACA TCG ATC TGC TTG TCG GGA 3 ' . 



The nucleotide sequences of the HCV cDNAs in 
clones CA59a, CA84a, CA156e, and CA167b, are shown Figs. 
15 6, 7, 8, and 9, respectively. The amino acids encoded 
therein, as well as the overlap with the sequences of 
relevant clones , are also shown in the figures . 

Creation of "pi" HCV cDNA Library 
20 A library of HCV cDNA, the "pi" library, was 

constructed from the same batch of infectious chimpanzee 
plasma used to construct the lambda-gtll HCV cDNA library 
(ATCC No. 40394) described in E.P.O. Publication No. 
318,216, and utilizing essentially the same techniques. 
25 However, construction of the pi library utilized a primer- 
extension method, in which the primer for reverse 
transcriptase was based on the sequence of clone CA59a. 
The sequence of the primer was: 

30 5' GGT GAC GTG GGT TTC 3' . 

Isolation and Sequence of Clone pi!4a 
Screening of the "pi" HCV cDNA library described 
supra., with the probe used to isolate clone CA167b (See 
35 supra.) yielded clone pil4a. The clone contains about 800 
base pairs of cDNA which overlaps clones CA167b, CA156e, 
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CA84a and CA59a, which were isolated from the lambda gt-11 
HCV cDNA library (ATCC No. 40394), in addition, pil4a also 
contains about 250 base pairs of DNA which are upstream of 
the HCV cDNA in clone CA167b. 

5 

Isolation and Sequence of Clones CA216a, CA290a and aq30a 

Based on the sequence of clone CA167b a 
synthetic probe was made having the following sequence: 

10 5' GGC TTT ACC ACG TCA CCA ATG ATT GCC CTA 3' 

The above probe was used to screen the , which yielded 
clone CA216a f whose HCV sequences are shown in Fig, 10. 

Another probe was made based on the sequence of 
15 clone CA216a having the following sequence: 

5' TTT GGG TAA GGT CAT CGA TAC CCT TAC GTG 3' 

Screening the lambda-gtll library (ATCC No- 40394) with 
20 this probe yielded clone CA290a, the HCV sequences therein 

being shown in Fig. 11. 

In a parallel approach, a primer-extension cDNA 

library was made using nucleic acid extracted from the 

same infectious plasma used in the original lambda-gtll 
25 cDNA library described above. The primer used was based 

on the sequence of clones CA216a and CA290a: 

5' GAA GCC GCA CGT AAG 3' 

30 The cDNA library was made using methods similar to those 
described previously for libraries used in the isolation 
of clones pil4a and k9-l. The probe used to screen this 
library was based on the sequence of clone CA290a: 



35 



5' CCG GCG TAG GTC GCG CAA TTT GGG TAA 3' 
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Clone ag30a was isolated from the new library with the 
above probe , and contained about 670 basepairs of HCV 
sequence. ' See Fig. 12. Part of this sequence overlaps 
the HCV sequence of clones CA216a and CA290a. About 300 
5 base-pairs of the ag30a sequence f however, is upstream of 
the sequence from clone CA290a. The non-overlapping 
sequence shows a start codon (*) and stop codons that may 
indicate the start of the HCV ORF. Also indicated in Fig. 
12 are putative small encoded peptides (#) which may play 
10 a role in regulating translation, as well as the putative 
first amino acid of the putative polypeptide (/) f and 
downstream amino acids encoded therein. 

Isolation and Sequence of Clone CA205a 
15 Clone CA205a was isolated from the original 

lambda gt-11 library (ATCC No. 40394), using a synthetic 
probe derived from the HCV sequence in clone CA290a (Fig. 
11). The sequence of the probe was: 

20 5 r TCA GAT CGT TGG TGG AGT TTA CTT GTT GCC 3' . 

The sequence of the HCV cDNA in CA205a f shown in Fig, 13, 
overlaps with the cDNA sequences in both clones ag30a and 
CA290a. The overlap of the sequence with that of CA290a 
25 is shown by the dotted line above the sequence (the figure 
also shows the putative amino acids encoded in this frag- 
ment) . 

As observed from the HCV cDNA sequences in 
clones CA205a and ag30a r the putative HCV polyprotein ap- 

30 pears to begin at the ATG start codon; the HCV sequences 
in both clones contain an in-frame f contiguous double stop 
codon (TGATAG) forty two nucleotides upstream from this 
ATG. The HCV ORF appears to begin after these stop 
codons, and to extend for at least .8907 nucleotides (See 

35 the composite HCV cDNA shown in Fig. 18). 
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Isolation and Sequence of Clone 18q 
Based on the sequence of clone ag30a (See Fig. 
12) and of an overlapping clone from the original lambda 
5 gt-11 library (ATCC No. 40394) , CA230a, a synthetic probe 
was made having the following sequence: 

5 ' CCA TAG TGG TCT GCG GAA CCG GTG AGT ACA 3 ' . 

10 Screening of the original lambda-gtll HCV cDNA library 

with the probe yielded clone 18g, the HCV cDNA sequence. of 
which is shown in Fig. 14. Also shown in the figure are 
the overlap with clone ag30a, and putative polypeptides 
encoded within the HCV cDNA. 

15 The cDNA in clone 18g (C18g or 18g) overlaps 

that in clones ag30a and CA205a, described supra. The 
sequence of C18g also contains the double stop codon 
region observed in clone ag30a. The polynucleotide region 
upstream of these stop codons presumably represents part 

20 of the 5 '-region of the HCV genome, which may contain 

short ORFs, and which can be confirmed by direct sequenc- 
ing of the purified HCV genome. These putative small 
encoded peptides may play a regulatory role in transla- 
tion. The region of the HCV genome upstream of that 

25 represented by C18g can be isolated for sequence analysis 
using essentially the technique described in E.P.O. 
Publication No. 318,216 for isolating cDNA sequences 
upstream of the HCV cDNA sequence in clone 12f . Es- 
sentially, small synthetic oligonucleotide primers of 

30 reverse transcriptase, which are based upon the sequence 
of C18g, are synthesized and used to bind to the cor- 
responding sequence in HCV genomic RNA. The primer 
sequences are proximal to the known 5 '-terminal of C18g f 
but sufficiently downstream to allow the design of probe 

35 sequences upstream of the primer sequences . Known 

standard methods of priming and cloning ar eused. The 
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resulting cDNA libraries are screened with sequences 
upstream of the priming sites (as deduced from the 
elucidated sequence of C18g) . The HCV genomic RNA is 
obtained from either plasma or liver samples from 
5 individuals with NANBH. Since HCV appears to be a Flavi- 
like virus f the 5 '-terminus of the genome may be modified 
with a "cap" structure. It is known that Flavi virus 
genomes contain 5 '-terminal "cap" structures. (Yellow 
Fever virus, Rice et al. (1988); Dengue virus, Hahn et al 
10 (1988); Japanese Encephalitis Virus (1987)). 

Isolation and Sequence of Clones from 
the beta-HCV cDNA library 
Clones containing cDNA representative of the 3'- 

15 terminal region of the HCV genome were isolated from a 
cDNA library constructed from the original infectious 
chimpanzee plasma pool which was used for the creation of 
the HCV cDNA lambda-gtll library (ATCC No. 40394), 
described in E.P.O. Publication No. 318 f 2 16. In order to 

20 create the DNA library, RNA extracted from the plasma was 
"tailed" with poly rA using poly (rA) polymerase, and cDNA 
was synthesized using oligo(dT) j^-is as a P r; * jner ^ or 
reverse transcriptase. The resulting RNArcDNA hybrid was 
digested with RNAase H, and converted to double stranded 

25 HCV cDNA. The resulting HCV cDNA was cloned into lambda- 
gtlO, using essentially the technique described in Huynh 
(1985), yielding the beta (or b) HCV cDNA library. The 
procedures used were as follows. 

An aliquot (12ml) of the plasma was treated with 

30 proteinase K, and extracted with an equal volume of phenol 
saturated with 0.05M Tris-Cl, pH 7.5, 0.05% (v/v) beta- 
mercaptoethanol, 0.1% (w/v) hydroxyquinolone, 1 mM EDTA. 
The resulting aqueous phase was re-extracted with the 
phenol mixture, followed by 3 extractions with a 1:1 

35 mixture containing phenol and chloroformrisoamyl alcohol 
(24:1), followed by 2 extractions with a mixture of 
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chloroform and isoamyl alcohol (1:1). Subsequent to 
adjustment of the aqueous phase to 200 mM with respect to 
NaCl, nucleic acids in the aqueous phase were precipitated 
overnight at -20°C f with 2.5 volumes of cold absolute 
5 ethanol. The precipitates were collected by centrifuga- 
tion at 10,000 RPM for 40 min., washed with 70% ethanol 
containing 20 mM NaCl, and with 100% cold ethanol, dried 
for 5 min. in a dessicator, and dissolved in water. 

The isolated nucleic acids from the infectious 

10 chimpanzee plasma pool were tailed with poly rA utilizing 
poly-A polymerase in the presence of human placenta 
ribonuclease inhibitor (HPRI) (purchased from Amersham 
Corp.), utilizing MS2 RNA as carrier. Isolated nucleic 
acids equivalent to that in 2 ml of plasma were incubated 

15 in a solution containing TMN (50 mM Tris HC1, pH 7.9, 10 

mM MgCl~, 250 mM NaCl, 2.5 mM MnCl 0 , 2 mM dithiothreitol 

* 32 * 

( DTT ) ) , 40 micromolar alpha-[ P] ATP, 20 units HPRI 

(Amersham Corp.), and about 9 to 10 units of RNase free 

poly-A polymerase (BRL). Incubation was for 10 min. at 

20 37°C, and the reactions were stopped with EDTA (final 
concentration about 250 mM) . The solution was extracted 
with an equal volume of phenol-chloroform, and with an 
equal volume of chloroform, and nucleic acids were 
precipitated overnight at -20°C with 2.5 volumes of 

25 ethanol in the presence of 200 mM NaCl. 

Isolation of Clone b5a 
The beta HCV cDNA library was screened by 
hybridization using a synthetic probe, which had a 
30 sequence based upon the HCV cDNA sequence in clone 15e. 

The isolation of clone 15e is described in E.P.O. Publica- 
tion No. 318/216, and its sequence is shown in Fig. 3. 
The sequence of the synthetic probe was: 

35 5' ATT GCG AGA TCT ACG GGG CCT GCT ACT CCA 3' . 
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Screening of the library yielded clone beta-5a (b5a), 
which contains an HCV cDNA region of approximately 1000 
base pairs. The 5 '-region of this cDNA overlaps clones 
35f , 19g, 26g, and 15e (these clones are described supra) . 
5 The region between the 3 '-terminal poly-A sequence and the 
3 '-sequence which overlaps clone 15e f contains ap- 
proximately 200 base pairs. This clone allows the 
identification of a region of the 3 '-terminal sequence the 
HCV genome. 

10 The sequence of b5a is contained within the 

sequence of the HCV cDNA in clone 16jh (described infra). 
Moreover, the sequence is also present in CC34a f isolated 
from the original lambda-gtll library (ATCC No. 40394). 
(The original lambda-gtll library is referred to herein as 

15 the "C" library). 

Isolation and Sequence of Clones Generated by PCR 
Amplification of the 3 '-Region of the HCV Genome 

Multiple cDNA clones have been generated which 

20 contain nucleotide sequences derived from the 3 '-region of 
the HCV genome. This was accomplished by amplifying a 
targeted region of the genome by a polymerase chain re- 
action technique described in Saiki et al. (1986), and in 
Saiki et al. (1988), which was modified as described 

25 below. The HCV RNA which was amplified was obtained from 
the original infectious chimpanzee plasma pool which was 
used for the creation of the HCV cDNA lambda-gtll library 
(ATCC No. 40394) described, in E.P.O. Publication No. 
318,216. Isolation of the HCV RNA was as described supra. 

30 The isolated RNA was tailed at the 3 '-end with ATP by E. 
coli poly-A polymerase as described in Sippel (1973), 
except that the nucleic acids isolated from chimp serum 
were substituted for the nucleic acid substrate. The 
tailed RNA was then reverse transcribed into cDNA by 

35 reverse transcriptase, using an oligo dT-primer adapter, 
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essentially as described by Han (1987), except that the 
components and sequence of the primer-adapter were: 

Stuf f er Not I SP6 Promoter Primer 

5 AATTC GCGGCCGC CATACGATTTAGGTGACACTATAGAA T 15 

The resultant cDNA was subjected to amplification by PCR 
using two primers: 



10 Primer Sequence 

JH32 (30mer) ATAGCGGCCGCCCTCGATTGCGAGATCTAC 
JH11 (20mer) AATTCGGGCGGCCGCCATACGA 



The JH32 primer contained 20 nucleotide sequences 

15 hybridizable to the 5 '-end of the target region in the 

cDNA, with an estimated T of 66°C. The JH11 was derived 
' m 

from a portion of the oligo dT-primer adapter; thus, it is 
specific to the 3 '-end of the cDNA with a T^ of 64°C. 
Both primers were designed to have a recognition site for 

20 the restriction enzyme, NotI, at the 5 '-end, for use in 
subsequent cloning of the amplified HCV cDNA. 

The PCR reaction was carried out by suspending 
the cDNA and the primers in 100 microliters of reaction 
mixture containing the four deoxynucleoside triphosphates, 

25 buffer salts and metal ions, and a thermostable DNA 
polymerase isolated from Thermus aquaticus (Taq 
polymerase), which are in a Perkin Elmer Cetus PCR kit 
(N801-0043 or N801-0055) . The PCR reaction was performed 
for 35 cycles in a Perkin Elmer Cetus DNA thermal cycler. 

30 Each cycle consisted of a 1.5 min denaturation step at 
94°C, an annealing step at 60°C for 2 min, and a primer 
extension step at 72°C for 3 min. The PCR products were 
subjected to Southern blot analysis using a 30 nucleotide 
probe, JH34, the sequence of which was based upon that of 

35 the 3 '-terminal region of clone 15e. The sequence of JH34 
is: 
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5 ' CTT GAT CTA CCT CCA ATC ATT CAA AGA CTC 3 ' . 



The PCR products detected by the HCV cDNA probe ranged in 
5 size from about 50 to about 400 base pairs. 

In order to clone the amplified HCV cDNA, the 
PCR products were cleaved with Not I and size selected by 
polyacrylamide gel electrophoresis. DNA larger than 300 
base pairs was cloned into the Not I site of pUC18S The 

10 vector pUC18S is constructed by including a NotI 

polylinker cloned between the EcoRI and Sail sites of 
pUC18. The clones were screened for HCV cDNA using the 
JH34 probe. A number of positive clones were obtained and 
sequenced. The nucleotide sequence of the HCV cDNA insert 

15 in one of these clones, 16 jh, and the amino acids encoded 
therein, are shown in Fig. 15. A nucleotide heterogene- 
ity, detected in the sequence of the HCV cDNA in clone 
16jh as compared to another clone of this region r is 
indicated in the figure. 



20 



25 



Isolation and Sequence of Clone 6k 
Based on the sequence of clone 16jh and clone 
b5a (see supra), a synthetic probe was made having the 
following sequence; 

5 ' TCT TCA ACT GGG CAG TAA GAA CAA AGC TCA 3 ' . 



Screening of the original lambda-gtll HCV cDNA library 
(described in E.P.O. Publication No. 318,216) with the 

30 probe yielded clones with a frequency of approximately 1 
in 10 6 ; one of these was called clone 6k (also called 
C6k), the HCV cDNA sequence of which is shown in Fig. 16. 
Also shown in the figure are the overlap with clone 16jh, 
and putative polypeptides encoded within the HCV cDNA. 

35 Sequence information on the HCV cDNA in clone 6k was 

obtained from only one strand. Information on the deposit 
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of this clone is provided infra, wherein the clone is 
listed as Lambda gtll C6k. Confirmation of the C6K 
sequence as part of an ORF encoding HCV1 polypeptide has 
been obtained by sequencing other overlapping clones. 

5 

Isolation and Sequence of Clone p!31jh 
A clone containing sequence from the 3 '-region 
of the HCV genome, and which contains an in- frame stop 
codon f was isolated essentially as described supra., for 
10 the isolation of clones generated by PGR amplification of 
the 3 'region of the genome, except that HCV1 RNA was 
converted to cDNA using the oligonucleotide 

5' AAT TCG CGG CCG CCA TAC GAT TTA GGT GAC 
15 ACT ATA GAA T 15 3' . 

The cDNA was then amplified by the PCR reaction using the 
primers r 

20 5' TTC GCG GCC GCT ACA GCG GGG GAG ACA T 3' 

and 

5 ' AAT TCG CGG CCG CCA TAC GA 3'. 

25 

After amplification, the PCR products were 
precipitated with spermine, digested with Not I, and 
extracted with phenol. The purified products were cloned 
into the NotI site of pUC18S, and HCV positive clones were 
30 selected using the oligonucleotide: 

5 ' CGA TGA AGG TTG GGG TAA ACA CTC CGG CCT 3 ' . 

The HCV cDNA in one clone, designated pl31jh f is shown in 
35 Fig. 17. This clone contains an in-frame stop codon for 
the large ORF contained in the HCV genome. 
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Isolation and Sequence of Clone 5'-clone32 

A clone containing sequence from the 5 '-region 
of the HCV genome, upstream of the sequence in clone 
5 b!14a f was isolated and the nucleotide sequence determined 
by a modification of the method for the isolation and 
sequence of clones generated by PCR amplification of the 
3'-region of the genome, described in U.S. S.N. 456,637, 
which is incorporated by reference. Generally, a target 

10 region of the genome was amplified by the PCR technique 
described in Saiki et al. (1986), and in Saiki et al 
(1988). The HCV RNA which was amplified was obtained by 
extracting human serum (U.S. clinical isolate, HCV27) 
using a cold guanidinium thiocyanate method described by 

15 Han et al. (1987). The extracted RNA was converted into 
single stranded cDNA with reverse transcriptase, using a 
primer, JH94, which is complementary to nucleotides -250 
to -223 of the HCV genome (see Fig. 18). The sequence of 
JH94 is: 

20 

5 ' CCT GCG GCC GCA CGA CAC TCA TAC TAA 3 ' . 

Conversion of single- to double-stranded HCV cDNA was ac- 
complished by tailing the DNA with approximately 20 to 50 
25 dA residues using terminal deoxynucleotidyl transferase 

(Sambrook et al. (1989), MOLECULAR CLONING), and replicat- 
ing the tailed molecule using the following oligo-dT 
primer-adapter, which contains a NotI site, and an sp6 
promoter : 

30 

Stuf fer NotI SP6 Promoter Primer 

AATTC GCGGCCGC CATACGATTTAGGTGACACTATAGAA T 15 

The resultant cDNA was subjected to amplification by PCR 
35 using two primers, JH94 (described supra.) and JH11, which 
has the following sequence. 
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Primer Sequence 

JH11 (20mer) AATTCGGGCGGCCGCCATACGA 

5 The PCR reaction was carried out by suspending 

the cDNA and the primers in 100 microliters of reaction 
mixture containing the four deoxynucleoside triphosphates, 
buffer salts and metal ions, and a thermostable DNA 
polymerase isolated from Thermus aquaticus (Taq 

10 polymerase), which are in a Perkin Elmer Cetus PCR kit 

(N801-0043 or N801-0055). The PCR reaction was performed 
for 35 cycles in a Perkin Elmer Cetus DNA thermal cycler . 
Each cycle consisted of a 1.5 min denaturation step at 
94°C, an annealing step at 60°C for 2 min, and a primer 

15 extension step at 72°C for 3 min. 

The PCR products were digested with NotI, and 
cloned into pUC18S. Clones containing HCV nucleotide 
sequences were obtained by screening with a probe, Alex90, 
which is derived from nucleotides -312 to -283 of the HCV1 

20 genome, and which has the sequence: 

5 ' ACC ATG AAT CAC TCC CCT GTG AGG AAC TAC 3 ' . 

The HCV cDNAs in the isolated clones were sequenced by the 
25 dideoxy chain termination method (Sanger et al . (1977)). 
The sequence of HCV cDNA in one of the isolated clones, 
5'-clone32, spans the region of nucleotides -224 to -341 
in Fig. 18. 

An analysis of the nucleotide sequence of the 
30 HCV cDNA showed that the replicate of the HCV RNA strand 
contains a GC-rich stretch which may be capable of forming 
a stable hairpin structure: 
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G-T 
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A C-C-C-C-C-G C-C-G' 5' 
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1 4 » 

I • f 



T-G-G-G-G-G-C-G-A-C- 3' 



In the structure, the dashed lines indicate possible 
hydrogen bonds between complementary nucleotides. 

10 A search in the computer database , Genebank, 

revealed that homologous sequences were absent from known 
viral sequences. Thus, this sequence may be unique to the 
5 '-terminus of the HCV genome. 

A hairpin structure may serve as a recognition 

15 signal for a transcriptase and/or it may contribute to the 
stability of the RNA at the 5 '-terminus. 



Compiled HCV cDNA Sequences 
An HCV cDNA sequence has been compiled from a 

20 series of overlapping clones derived from various HCV cDNA 
libraries described herein, and in E.P.O. Publication No. 
318,216. The clones from which Fig. 18 has been derived 
are clone 5'-32, bll4a, 18g, ag30a, CA205a, CA290a, 
CA216a, pil4a, CA167b, CA156e, CA84a, CA59a, K9-1 (also 

25 called k9-l), 26j, 13i, 12f, 14i, lib, 7f, 7e, 8h, 33c, 
40b, 37b, 35, 36, 81, 32, 33b, 25c, 14c, 8f, 33f, 33g, 
39c, 35f, 19g, 26g, 15e, b5a, 16jh, C6k and pl31jh. The 
methods for isolation of these clones, as well as their 
sequences, are discussed herein, and in E.P.O. Publication 

30 No. 318,216, which is incorporated herein by reference. 
In Fig. 18, the three dashes above the sequence indicate 
the position of the putative initiator methionine codon. 

Clone bl!4a overlaps with clones 18g, ag30a f and 
CA205a, except that clone bll4a contains an extra two 

35 nucleotides upstream of the sequence in clone 18g (i.e., 
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5'-CA). These extra two nucleotides have been included in 
the HCV genomic sequence shown in Fig. 18. 

It should be noted that although several of the 
clones described supra, have been obtained from libraries 
5 other than the original HCV cDNA lambda-gtll C library 
(ATCC No. 40394) , these clones contain HCV cDNA sequences 
which overlap HCV cDNA sequences in the original library. 
Thus, essentially all of the HCV sequence is derivable 
from the original lambda-gtll C library (ATCC No. 40394) 
10 which was used to isolate the first HCV cDNA clone (5-1- 
1). The isolation of clone 5-1-1 is described in E.P.O. 
Publication No. 318,216, which is incorporated herein by 
reference. 

The putative sequence of the major HCV 

15 polyprotein encoded in the composite of HCV1 cDNA is also 
shown. The first amino acid in the sequence is the puta- 
tive initiator methionine of the large ORF. The variant 
amino acids, due to the clonal heterogeneities, are 
indicated above the sequence. Since the lambda gtll 

20 library was created from serum obtained from one 

individual (see E.P.O. Publication No. 318,216), the 
results suggest that variant viral sequences (both 
nucleotide and amino acid) are present in that individual. 
An examination of the composite HCV cDNA 

25 sequence shows that besides the large ORF, there are a 
number of ORFs upstream of that encoding the polyprotein, 
and within the sequence encoding the polyprotein there are 
a large number of smaller ORFs in the other two 
translational frames. The ORFs upstream of the HCV 

30 polyprotein are shown in the Table immediately below. 



35 
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Table 

ORFs Upstream of that Encoding the Large 
HCV Polyprotein 

5 

Nucl . # Translation Frame Amino Acid Sequence 

-310 1 MNHSPVRNYCLHAESV 

-329 3 MGATLHHESLPCEELL 

SSRRKRLAMALV 

10 -246 2 MSWQPPGPPLPGEP 

-127 1 MPGDLGVPPQDC 

The reading frame , position , and size of the ORFs 
downstream of the sequence encoding the putative initiator 
15 MET of the polyprotein are shown in the Table below. The 
major polyprotein is that translated from reading frame 2. 

Table 

ORFs Downstream of the Putative Initiator MET 
20 Encoding Sequence 

Reading Frame Size(aa) Position (bp) 

1 168 696 

1 105 2343 
25 1 119 5616 

2 3025 -42 

3 160 5 
3 111 1667 
3 148 6893 

30 

In addition to the above, an examination of the 
sequence which is complementary to the genomic strand of 
HCV RNA also contains several small ORFs. One of these 
ORFs, which is complementary to nucleotides -341 to +837 
35 in the HCV RNA sequence, encodes a polypeptide of 385 
amino acids. 
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Comparison of the Sequences of 5 ' -Regions 
Obtained from HCV Isolates from Different 
Geographical Locations 
5 Nucleotide sequences from the 5'- regions of HCV 

isolates from the U.S.A. (HCV18, HCV27), from Italy 
(HCVI1, HCVI24), and from Korea (HCVK1) were compared. 

Isolation of the HCV cDNA sequences was es- 
sentially as described supra. f for the isolation of 5'- 
10 clone32 f except for the following. The extracted RNA was 
reverse-transcribed into cDNA using as primers either JH51 
or rl6, which are complementary to HCV nucleotides -90 to 
-73 and 366 to 383, respectively. The sequences of these 
primers are as follows. 

15 

Primer Seguence 
JH51 5' CCC AAC ACT ACT CGG CTA 3' 

r!6 5' CAC GTA AGG GTA TCG ATG 3' 

20 Amplification of the HCV dsDNA was by the PCR method using 
JH93 and JH52 as 5'- and 3'- primers, respectively. The 
HCV sequence in JH93 is derived from HCV nucleotides -317 
to -296, that in JH52 is from HCV nucleotides -93 to -117; 
the nucleotide numbers are indicated in parentheses below 

25 the sequences. In JH52 the underlined dinucleotide has 
been mutated to create the NotI site. The sequences of 
these primers are the following. 

(Primer) Stuf fer NotI HCV sequence 

30 (JH93) 5' TTC GCGGCCGC ACTCCATGAATCACTCCCC 3' 

(-317) (-296) 

(JH52) 5' AGTCTT GCGGCCGC ACGCCCAAATC 3' 

(-93) (-117) 

35 
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After amplification, the PCR products were cleaved by 
NotI f and cloned into pUC18S. The HCV cDNAs were 
sequenced either by direct sequencing after amplification 
by PCR, or alternatively, the cloned HCV cDNAs were 
5 sequenced by the primer extension and the dideoxy method. 
Primer extension and the dideoxy method of sequencing were 
performed as described supra,, for the sequence of 5'- 
clone32. 

The PCR method for direct sequencing used Alex90 
10 (see supra, for the sequence) as the 5 '-primer, and r25 as 
the 3 '-primer. Alex90 is derived from HCV nucleotides 
-312 to -283, and r25 is derived from nucleotides 365 to 
342 (See Fig. 18). The sequence of r25 is: 

15 5 ' ACC TTA CCC AAA TTG CGC GAC CTA 3 ' . 

A comparison of the sequences of the 5 '-region 
of HCV27, HCVK1, HCVI1, HCVI24, and HCV18 with the 
sequence of the prototype HCV, HCV1, showed the following. 

20 The examined 5'- region is highly conserved amongst the 5 
HCV isolates. The sequences appeared to be identical 
except for one nucleotide which was deleted at position - 
171 in HCVI24, and for the ambiguity in four nucleotides 
at positions -222 to -219 in isolate HCVK1. 

25 The high levels of sequence conservation in this 

region may reflect the role of this region in viral 
replication, and/or transcription, and/or translation. 

Sequence Variations in HCV Isolates 
30 from Different Individuals 

Isolates of HCV which contain sequences which 
deviate from CDC/HCV1 were identified in human 
individuals, some of whom were serologically positive for 
anti-C100-3 antibodies (EC10 was antibody negative). 
35 Identification of these new isolates was accomplished by 
cloning and sequencing segments of the HCV genome which 
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had been amplified by the PCR technique using CDC/HC1 
sequences. Amplification was accomplished essentially 
based on an HCV/cPCR method. The method utilizes primers 
and probes based upon the HCV cDNA sequences described 
5 herein. The first step in the method is the synthesis of 
a cDNA to either the HCV genome, or its replicative inter- 
mediate, using reverse transcriptase. After synthesis of 
the HCV cDNA, and prior to amplification, the RNA in the 
sample is degraded by techniques known in the art. A 

10 designated segment of the HCV cDNA is then amplified by 
the use of the appropriate primers. The amplified 
sequences are cloned, and clones containing the amplified 
sequences are detected by a probe which is complementary 
to a sequence lying between the primers, but which does 

15 not overlap the primers. 

HCV Isolates Isolated from Humans in the U.S. 

Blood samples which were used as a source of HCV 
virions were obtained from the American Red Cross in 

20 Charlotte, North Carolina , and from the Community Blood 

Center of Kansas, Kansas City, Missouri. The samples were 
screened for antibodies to the HCV C100-3 antigen using an 
EL ISA assay as described in E.P.O. Publication No. 
318, 216 , and subjected to supplemental Western blot 

25 analysis using a polyclonal goat anti-human HRP to measure 
anti-HCV antibodies. Two samples, #23 and #27, from the 
American Red Cross and from the Community Blood Center of 
Kansas, respectively, were determined to be HCV positive 
by these assays. 

30 Viral particles present in the serum of these 

samples were isolated by ultracentrif ugation under the 
conditions described by Bradley °t al . (1985). RNA was 
extracted from the particles by digestion with proteinase 
K and SDS at final concentrations of 10 micrograms /ml 

35 proteinase K, and 0.1% SDS; digestion was for 1 hour at 
37°C. Viral RNA was further purified by extraction with 
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chlorof orm-phenol , as described in E.P.O. Publication No. 
318,216. 

HCV RNA in the preparation of RNA was reverse 
transcribed into cDNA essentially as described in E.P.O. 
5 Publication No. 318,216, except that the oligonucleotide 
JHC 7 , which corresponds to the cDNA sequence 1958-1939, 
and which has the following sequence, was used as primer 
for the reverse transcriptase reaction. 

10 JHC 7: CCA GCG GTG GCC TGG TAT TG. 

After both strands of the cDNA were synthesized, 
the resulting cDNA was then amplified by the PCR method 
essentially as described supra, for the isolation of 

15 clones generated by PCR amplification, except that the 

oligonucleotide primers used, i.e., JHC 6 and ALX 80, were 
designed to amplify a 1080 nucleotide segment of the HCV 
genome from CDC/HCV1 nucleotides 673 to 1751. The prim- 
ers, in addition, are designed to incorporate a NOT I 

20 restriction site at the 3 '-end of the PCR product, and a 
blunt end at the 5' -terminus. The sequences of the prim- 
ers is: 



25 



30 



ALX 80: TTT GGG TAA GGT CAT CGA TAC CCT TAC GTG; 

and 

JHC 6: ATA TGC GGC CGC CTT CCG TTG GCA TAA. 



ALX 80 corresponds to nucleotides 67 3-702 of the CDC/HCV1 
sequence; JHC 6 corresponds to nucleotides 1752-1738 of 
the HCV1 (in addition there are 12 extra nucleotides which 
encode a NotI site). The designation of nucleotides in 
JHC 6, i.e., a declining number, indicates the placement 
35 in the anti-sense strand. 
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After PCR amplification with the above described 
primers , the blunt end terminus was converted into a NOT I 
site as follows, A homopolymer tail of 15 dGs was at- 
tached to the PCR product using terminal deoxynucleotide 
5 transferase, and the products were again subjected to 
amplification by PCR using as primers JHC 6 and JHC 13. 
The latter primer, JHC 13, the sequence of which follows, 
is designed to contain a NOT I site in addition to an SP6 
phage promoter. (The SP6 promoter is described in GENETIC 
10 ENGINEERING, J. Setlow Ed. (1988). 

JHC 13: AAT TCG CGG CCG CCA TAC GAT TTA GGT GAC 
ACT ATA GAA CCC CCC CCC CCC CCC. 



15 in order to clone the amplified HCV cDNA, the 

PCR products were cleaved with Notl, precipitated with 
spermine to remove free oligonucleotides (Hoopes et al. 
(1981)), and cloned into the Notl site of pUC18S (see Sec- 
tion IV.A.34.). The HCV cDNAs in three clones derived 

20 from each HCV isolate, were subjected to sequence 
analysis. Analysis was essentially by the method 
described in Chen and Seeburg (1985). 

Consensus sequences of the clones derived from 
HCV in samples 23 and 27 are shown in Fig. 46 and Fig. 47, 

25 respectively. The variable sequences are also shown in 
these figures, as are the amino acids encoded in the 
consensus sequences . 

Fig. 39 and Fig. 40 show comparisons of the 
aligned positive strand nucleotide sequences (Fig. 39) and 

30 putative amino acid sequences (Fig. 40) of samples 23, 27, 
and HCV1. The amino acid sequence of HCV1 in Fig. 39 
represents amino acid numbers 129-467 of the HCV 
polyprotein encoded by the large ORF in the HCV genomic 
RNA. An examination of Fig. 46 and Fig. 47 show that 

35 there are variations in the sequences of the three 
isolated clones. The sequence variations at the 
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nucleotide level and the amino acid level are summarized 
in the table immediately below. in the table, the 
polypeptides designated S and NS1 represent amino acid 
numbers 130 to ~380, and 380 to ~470, respectively. The 
5 numbering is from the putative initiator methionine. The 
terminology S and NS1 is based upon the positioning of the 
sequences encoding the polypeptides using the Flavivirus 
model* As discussed above , however , recent evidence sug- 
gests that there is not total correlation between HCV and 
10 the Flaviviruses with regard to viral polypeptide domains, 
particularly in the putative E/NS1 domains. Indeed, HCV 
polypeptides and their coding domains may exhibit 
substantial deviation from the Flavivirus model. 

15 Table 

Sequence Homology 



Nucleotide Encoding Amino Acid Encoded 





overall 


S 


NS1 


overall 


S 


NS1 




% 


% 


% 


% 


% 


% 


HCV1/HCV23 


93 


95 


91 


92 


95 


87 


HCV1/HCV27 


89 


93 


84 


89 


95 


82 


HCV23/HCV27 


89 


93 


85 


90 


93 


84 



25 Although there are variations in the newly 

isolated HCV sequences, the cloned sequences from samples 
23 and 27 (called HCV23 and HCV27) each contain 1019 
nucleotides, indicating a lack of deletion and addition 
mutants in this region in the selected clones . The 

30 sequences in Figs. 39 and 40 also show that the isolated 
sequences are not rearranged in this region. 

A comparison of the consensus sequences for HCV1 
and for the other isolates of HCV is summarized in the 
Table, supra. The sequence variations between the 

35 chimpanzee isolate HCV1, and the HCVs isolated from humans 
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are about the same as that seen between the HCVs of human 
origin • 

It is of interest that the sequence variations 
in two of the putative domains is not uniform. The 
5 sequence in a putative S region appears to be relatively- 
constant, and randomly scattered throughout the region. 
In contast, a putative NS1 region has a higher degree of 
variability than the overall sequence , and the variation 
appears to be in a hypervariable pocket of about 28 amino 

10 acids which is located about 70 amino acids downstream 
from the putative N-terminus of the putative polyprotein. 

Although it may be argued that the detected 
variations were introduced during the amplification proc- 
ess, it is unlikely that all of the variations are from 

15 this result. It has been estimated that Taq polymerase 
introduces errors into a sequence at approximately one 
base per 10 kilobases of DNA template per cycle (Saiki et 
al, (1988)). Based upon this estimate, up to 7 errors may 
have been introduced during the PCR amplification of the 

20 1019 bp DNA fragment. However, the three subclones of 
HCV-23 and HCV-27 yielded 29 and 14 base variations, 
respectively. The following suggest that these variations 
are naturally occurring. About 60% of the base changes 
are silent mutations which do not change the amino acid 

25 sequence. Variations introduced by the Taq polymerase 
during PCR amplification would be expected to occur 
randomly; however, the results show that the variant 
sequences are clustered in at least one specific region. 
Moreover, a consensus sequence was derived by sequencing 

30 multiple different clones derived from the PCR amplified 
products . 



HCV Isolates from Humans in 
Italy and in the U.S. 
35 Segments of HCV RNA present in different 

isolates were amplified by the HCV/cPCR method. These 
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segments span a region of ~0.6Kb to ~ 1.6Kb downstream from 
the methionine encoding start codon of the putative HCV 
polyprotein. The isolates are from biological specimens 
obtained from HCV infected individuals. More 
5 specifically, isolate HCT #18 is from human plasma from an 
individual in the U.S.A. , EC1 and EC10 are from a liver 
biopsy of an Italian patient, and Th is from a peripheral 
blood mononucleocyte fraction of an American patient. 
Comparable segments of HCV RNA have been isolated from a 

10 chimpanzee . 

RNA was extracted from the human plasma 
specimens using phenol :CHC13:isoamyl alcohol extraction. 
Either 0.1 ml or 0.01 ml of plasma was diluted to a final 
volume of 1.0 ml, with a TENB/proteinase K/SDS solution 

15 (0.05 M Tris-HCL, pH 8.0, 0.001 M EDTA, 0.1 M NaCl, 1 mg/ 
ml Proteinase K, and 0.5% SDS) containing 10 to 40 
micrograms /ml polyadenylic acid, and incubated at 37°C for 
60 minutes. After this proteinase K digestion, the 
resultant plasma fractions were deproteinized by extrac- 

20 tion with TE (50 mM Tris-HCl, pH 8.0, 1 mM EDTA) saturated 
phenol, pH 6.5. The phenol phase was separated by 
centrifugation, and was reextracted with TENB containing 
0.1% SDS. The resulting aqueous phases from each extrac- 
tion were pooled, and extracted twice with an equal volume 

25 of phenol/chloroform/isoamyl alcohol [1:1(99:1)], and then 
twice with an equal volume of a 99:1 mixture of 
chloroform/isoamyl alcohol. Following phase separation by 
centrifugation, the aqueous, phase was brought to a final 
concentration of 0.2 M Na Acetate, and the nucleic acids 

30 were precipitated by the addition of two volumes of 

ethanol. The precipitated nucleic acids were recovered by 
ultracentrifugation in a SW 41 rotor at 38 K, for 60 
minutes at 4°C, or in a microfuge for 10 minutes at 10K, 
4°C. 
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RNA extracted from the liver biopsy was provided 
by Dr. F. Bonino f Ospedale Maggiore di S. Giovanni 
Battista, Torino, Italy. 

The mononucleocyte fraction was obtained by 
5 sedimentation of the individual's aliquot of blood through 
Ficoll-Pagues (Pharmacia Corp), using the manufacturer's 
directions. Total RNA was extracted from the fraction 
using the guanidinium thiocyanate procedure described in 
E.P.O. Publication No. 318,216 (See also Choo et al 
10 (1989)). 

Synthesis of HCV cDNA from the samples was ac- 
complished using reverse transcriptase, and primers 
derived from clone 156e and from clone K91. These prim- 
ers, which are anti-sense relative to the genomic RNA, 
15 have the following sequences. 

156el6B: 5' CGA CAA GAA AGA CAG A3', 

and 

K91/16B 5' CGT TGG CAT AAC TGA T 3'. 

20 

Following ethanol precipitation, the 
precipitated RNA or nucleic acid fraction was dried, and 
resuspended in DEPC treated distilled water. Secondary 
structures in the nucleic acids were disrupted by heating 

25 at 65°C for 10 minutes, and the samples were immediately 
cooled on ice. cDNA was synthesized using 1 to 3 micro- 
grams of total RNA from liver, or from nucleic acids (or 
RNA) extracted from 10 to 100 microliters of plasma. The 
synthesis utilized reverse transcriptase, and was in a 25 

30 microliter reaction, using the protocol specified by the 
manufacturer, BRL . All reaction mixtures for cDNA 
synthesis contained 23 units of the RNAase inhibitor, 
RNASIN™ (Fisher/Promega) . Following cDNA synthesis, the 
reaction mixtures were diluted with water, boiled for 10 

35 minutes, and quickly chilled on ice. 
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Each set of samples was subjected to two rounds 
of PCR amplification. The primers for the reactions were 
selected to amplify regions designated "EnvL" and EnvR" . 
The "EnvL" region encompasses nucleotides 669-1243 f and 
5 putative amino acids 117 to 308; the "EnvR" region en- 
compasses nucleotides 1215-1629 f and encodes putative 
amino acids 300-408 (the putative amino acids are numbered 
starting from the putative methionine initiation codon) . 
The relationship of these regions relative to the putative 
10 polyprotein encoded in the HCV cDNA, and to the 

polypeptides encoded in the Flavirus model is shown in 
Fig. 48. 

The primers for the first round of PCR reactions 
were derived from the HCV cDNA sequences in either clone 

15 ag30a, clone 156e f or clone k9-l. The primers used for 
the amplification of the EnvL region were 156el6B (shown 
supra), and ag30al6A for the sense strand; the amplifica- 
tion of the EnvR region utilized the primer K91/16B (shown 
supra), and 156el6a for the sense strand. The sequences of 

20 the sense strand primers are the following. 

For EnvL, ag30al6A: 5' CTC TAT GGC AAT GAG G 3' , 



25 



and 



For EnvR, 156el6A: 



AGC TTC GAC GTC ACA T 3' 



The PCR reactions, were performed essentially 
according to the manufacturer's directions (Cetus-Perkin- 

30 Elmer), except for the addition of 1 microgram of RNase A. 
The reactions were carried out in a final volume of 100 
microliters. The PCR was performed for 30 cycles, utiliz- 
ing a regimen of 94°C (1 min) f 37°C (2 min) f and 72°C (3 
min), with a 7 minute extension at 72°C for the last 

35 cycle. The samples were then extracted with phenol : CHC1 3 r 
ethanol precipitated two times, resuspended in 10 mM Tris 
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HCl, pH 8.0/ and concentrated using Centricon-30 (Amicon) 
filtration. This procedure efficiently removes 
oligonucleotides less than 30 nucleotides in size; thus, 
the primers from the first round of PCR amplification are 
5 removed . 

The Centricbn-30 concentrated samples were then 
subjected to a second round of PCR amplification using 
probes designed from clones 202a and 156e for the EnvL 
region, and from 156e and 59a for the EnvR region. The 
10 primers for amplification of the EnvL region have the fol- 
lowing sequences. 

202aEnv41a: 5' CTT GAA TTC GCA ATT TGG GTA 

AGG TCA TCG ATA CCC TTA CG 3' 



15 



and 



156e38B r : 5' CTT GAA TTC GAT AGA GCA ATT 

GCA ACC TTG CGT CGT CC 3 ' . 

20 

The primers for amplification of the EnvR region in RNAs 
derived from humans have the following sequences. 

156e38A': 5' CTT GAA TTC GGA CGA CGC AAG 

25 GTT GCA ATT GCT CTA TC 3' 



and 



59aEnv39C: 5' CTT GAA TTC CAG CCG GTG TTG 

30 AGG CTA TCA TTG CAG TTC 3' . 

Amplification by PCR was for 35 cycles utilizing a regimen 

of 94°C (1 min), 60°C (1 min), and 72°C (2 min), with a 7 

minute extension at 72°C for the last cycle. The samples 

35 were then extracted with phenol :CHC1 3/ precipitated two 

times, and digested with EcoRI . The PCR reaction products 
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were analyzed by separation of the products by 
electrophoresis on 6% polyacrylamide gels. DNA of ap- 
proximately the estimated size of the expected PCR product 
was electroeluted from the gels f and subcloned into either 
5 a pGEM-4 plasmid vector or into lambda gtll. The expected 
product sizes for the EnvL and EnvR after the first round 
of amplification are 615 bp and 683 bp, respectively; 
after the second round of amplification the expected 
product sizes for EnvL and EnvR are 414 bp and 575 bp, 

10 respectively. The plasmids containing the amplified 
products were used to transform host cells; the pGEM-4 
plasmid was used to transform DH5 -alpha, and lambda gtll 
was used to transform C600 delta-HFL. Clones of the 
transformed cells which either hybridized to the appropri- 

15 ate HCV probes (described below), or those which had 

inserts of the correct size were selected. The inserts 

were then cloned in M13 and sequenced. 

The probes for all of the HCV/cPCR products 
32 

consisted of P labeled sections of HCV cDNA which had 
20 been prepared by PCR amplification of a region of clone 

216 (using CA216al6A and 216al6B as primers), and of clone 
84 (using CA84al6A and CA84al6B or CA84al6C as primers); 
32 P was introduced into the PCR products by nick transla- 
tion. The probes for the first and second round of EnvL 
25 amplification were from clone 216. Those for the first 
round of EnvR amplification were from 84 (i.e., CA84al6A 
and CA84al6B), for the second round of EnvL amplification 
were CA84al6A and CA84al6C. These probes did not overlap 
the primers used in the HCV/cPCR reactions. The sequence 
30 of the primers for the PCR amplification of the probes is 
in the following table. 



35 
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Table 



10 



Primer Clone Sequence 

CA216al6A 216 5' TGA ACT ATG CAA CAG G 3 

CA216al6B 216 5' GGA GTG TGC AGG ATG G 3' 

CA84al6A 84 5' AAG GTT GCA ATT GCT C 3' 

CA84al6B 84 5' ACT AAC AGG ACC TTC G 3' 

CA84al6C 84 5' TAA CGG GTC ACC GCA T 3' 



Sequence information on variants in the EnvL 
region was obtained from 3 clones from HCT #18 , 2 clones 
from TH, 3 clones from EC1, and from the HCV1 clones 
described in E.P.O. Publication No. 318 , 216 , and supra. A 

15 comparison of the composite nucleotide sequence of each 

isolate derived from these clones is shown in Fig. 49. In 
the figure, each sequence is shown 5' to 3' for the sense 
strand for the EnvL region, and the sequences have been 
aligned. The vertical lines and capital letters indicate 

20 sequence homology, the absence of a line and an 

uncapitalized letter indicates a lack of homology. The 
sequences shown in the lines are as follows: line 1, 
Thorn; line 2, EC1; line 3, HCT #18; line 4, HCV1 . 

Sequence information on variants in the EnvR 

25 region was obtained from two clones of EC10, and from the 
HCV1 clones described in E.P.O. Publication No. 318,216 
and supra.. The two EC10 clones differed by only one 
nucleotide. A comparison of the nucleotide sequences of 
EC10( clone 2) and a composite of the HCV1 sequences is 

30 shown in Fig. 50; each sequence is shown 5' to 3' for the 
sense strand of the EnvR region, and the sequences have 
been aligned. The double dots between the sequences 
indicate sequence homology. 

A comparison of the amino acid sequences encoded 

35 in the EnvL (amino acids #117-308) and EnvR region (amino 
acids #300-438) for each of the isolates is shown in Fig. 



WO 90/14436 PCIYUS90/02853 

-84- 

51 and Fig. 52, respectively. Included in the Figures are 
sequences for the isolates JH23 and JH27 f described supra. 
Also indicated are sequences from a Japanese isolate; 
these sequences were provided by Dr. T. Miyamura f Japan. 
5 In the figures, the amino acid sequence for the region is 
given in its entirety for HCV1, and the non-homologous 
amino acids in the various isolates are indicated. 

As seen in Fig. 51, In the EnvL region there is 
overall about a 93% homology between HCV1 and the other 

10 isolates. HCT18, Th, and EC1 have about a 97% homology 
with HCV1; JH23 and JH27 have about 96% and about 95% 
homology, respectively, with HCV1. Fig. 52 shows that the 
homologies in the EnvR region are significantly less than 
in the EnvL region; moreover, one subregion appears to be 

15 hypervariable (i.e., from amino acid 383-405). This. data 
is summarized in the Table immediately below. 

Table 

Homology of EnvR Region 

20 

Isolate Percent Homology with HCV1 

AA330-AA438 AA383-AA405 

JH23(U.S.) 83 57 

JH27(U.S.) 80 39 

25 Japanese 73 48 

EC10 (Italy) 84 48 

Detection of Positive and Negative Strand 
5'-HCV RNA in Serum 
30 The RNA in HCV27, isolated from serum, was 

analyzed for the presence of positive and negative strands 
using the PCR method. The PCR method was performed es- 
sentially as described above, except for the following. 
The extracted HCV27 RNA was reverse transcribed into 
35 single-stranded cDNA using as a primer either Alex90 or 
JH52 (see supra, for the sequences). The sequence of 
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Alex90 matches that in nucleotides -312 to -283 of the 
positive strand of HCV RNA, whereas JH52 matches that of 
nucleotides -117 to -93 of the negative strand. the 
resulting single-stranded HCV cDNAs were each separately 
5 amplified by PCR using Alex90 and JH52. Detection of the 
amplified products was accomplished by Southern blotting, 
using Alex89 as the probe. Alex89 matches nucleotide 
numbers -203 to -175 of HCV RNA. The sequence of Alex89 
is: 

10 

5 ' CCA TAG TGG TCT GCG GAA CCG GTG AGT ACA 3 ' . 

The analysis indicated that f by this method, the signals 
of the amplified products of both RNA strands were of 
15 equal intensity. These results are suggestive that HCV 
RNA in the 5 '-region may exist as double-stranded RNA. 

Probes for Sandwich Hybridization for HCV 

This example exemplifies the sets of label and 

20 capture probes useful to detect HCV RNA in biological 
samples, using essentially the assay described in U.S. 
Patent No. 4,868,105. The method is a solution-phase 
sandwich hybridization assay which utilizes both capture 
and label probes which hybridize to target sequences in an 

25 analyte nucleic acid. In the screening of biological 

samples for HCV, the probes used bind to conserved regions 
of the HCV genome, and the HCV binding regions are 
selected for their uniqueness to the HCV genome. The 
regions which bind to the binding partner of the capture 

30 probe, or the portion of the label probe which binds to 
the labeling moiety (or to an amplifying multimer if the 
method described in E.P-0. Publication No. 317,077 is 
used), are selected such that they do not bind to any of 
the known sequences in the databank or in HCV f and which 

35 have the appropriate content of Gs and Cs to allow stable 
duplex formation with their complements under the selec- 
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tion conditions. The capture and label probes are in 
sets, and the probes of one set do not intersperse with 
the probes of another set. These probes are comprised of 
sequences which are complementary to the following 
5 nucleotide sequences in the coding strand of the proto- 
type HCV cDNA sequence shown in Fig. 18. 

Set 1 



10 


Probe type 


Probe Number 


Complement of 
Nucleotide Numbers 




Capture 


42.XT1.1 


-318 


to 


-289 




Capture 


42.XT1.2 


-285 


to 


-256 




Capture 


42.XT1.3 


-252 


to 


-223 


15 


Capture 


42.XT1.4 


-219 


to 


-190 




Label 


42.LLA2C.5 


-186 


to 


-157 




Label 


42.LLA2C.6 


-153 


to 


-124 




Label 


42.LLA2C7 


-120 


to 


-91 




Label 


42 .LLA2C.8 


-87 


to 


-58 


20 


Label 


42.LLA2C.9 


-54 


to 


-25 




Label 


42.LLA2C. 10 


-21 


to 


9 




Label 


42.LLA2C.il 


13 


to 


42 




Label 


42.LLA2C.12 


46 


to 


75 




Label 


42.LLA2C.13 


79 


to 


108 


25 


Label 


42.LLA2C. 14 


112 


to 


141 




Label 


42.LLA2C.15 


145 


to 


174 



30 



35 
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Set 2 



Probe type 



Probe Number 



Complement of 
Nucleotide Numbers 



Capture 


42 


.16.XT1 


4378 


to 


4407 


Capture 


42 


.17.XT1 


4411 


to 


4440 


Capture 


42 


.18.XT1 


4444 


to 


4473 


Capture 


42 


.19.XT1 


4477 


to 


4506 


Capture 


42 


.20.XT1 


4510 


to 


4539 


Label 


42 


.21.LLA2C 


4543 


to 


4572 


Label 


42 


.22.LLA2C 


4576 


to 


4605 


Label 


42 


.23.LLA2C 


4609 


to 


4638 


Label 


42 


.24.LLA2C 


4642 


to 


4671 


Label 


42 


.25.LLA2C 


4675 


to 


4704 


Label 


42 


.26.LLA2C 


4708 


to 


4737 


Label 


42 


.27 .LLA2C 


4771 


to 


4770 


Label 


42 


.28.LLA2C 


4774 


to 


4803 


Label 


42 


.29.LLA2C 


4807 


to 


4836 


Label 


42 


.30.LLA2C 


4840 


to 


4869 


Label 


42 


•31.LLA2C 


4873 


to 


4902 



25 



30 



35 
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Set 3 



5 Probe type Probe Number Complement of 

Nucleotide Numbers 



Capture 


42 


.32.XT1 


4056 


to 


4085 


Capture 


42 


.33.XT1 


4089 


to 


4085 


Capture 


42 


.34.XT1 


4122 


to 


4151 


Capture 


42 


.35.XT1 


4155 


to 


4184 


Label 


42 


•36.LLA2C 


4188 


to 


4217 


Label 


42 


.37.LLA2C 


4221 


to 


4250 


Label 


42 


.38.LLA2C 


4254 


to 


4283 


Label 


42 


.39.LLA2C 


4287 


to 


4316 


Label 


42 


.40.LLA2C 


4230 


to 


4349 


Label 


42 


•41.LLA2C 


4353 


to 


4382 


Label 


42 


.42.LLA2C 


4386 


to 


4415 


Label 


42 


.43.LLA2C 


4419 


to 


4448 



20 In the above sets, each capture probe contains , in addi- 
tion to the sequences complementary to the HCV sequences, 
the following sequence downstream of the HCV sequence 
(i.e., at the 3 '-end): 

25 5 ' CTT CTT TGG AGA AAG TGG TG 3 ' . 

The sequence common to each capture probe is complementary 
to a sequence in the binding partner(s), so that after 
hybridization, the duplex can be captured via affixation 
30 to the solid phase. 

Also, in each set, each label probe contains, in 
addition to the sequences complementary to the HCV 
sequences, the following sequence downstream of the HCV 
sequence : 

35 

5 ' TTA GGC ATA GGA CCC GTG TC 3'. 
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If the method described in E.P.O. Publication No. 317,077 
is used, the sequence common to each label probe is com- 
plementary to a sequence in a mul timer, to allow hybrid 
5 duplex formation with that multimer. 

The sequences of the probes in the above sets 
are shown in Fig. 19. 

Detection of HCV Polynucleotide Sequences 

10 Using PCR Amplification 

In the generalized method for amplification of 
HCV RNA by cPCR it is contemplated that the RNA strand is 
a virion or mRNA strand, which is a "sense" strand. 
However, it is also possible that replicative intermediate 

15 forms may also be detected which would be "anti-sense"; in 
this case the primer would be "sense". An RNA sense strand 
containing the target region is hybridized with an anti- 
sense primer which primes the synthesis of the replicate 
strand containing the target. cDNA to the RNA template is 

20 synthesized with a primer- and template-dependent reverse 
transcriptase. The cDNA in the resulting RNA:cDNA hybrid 
is released by denaturation and treatment with RNAse. 
Primers are annealed to the cDNA, and extended with a 
primer- and template-dependent DNA polymerase. The 

25 products are denatured, re-annealed to primers, and a 
second round of synthesis is conducted. A number of 
cycles are run until the amplified product containing the 
target region is in a desired amount, which is at least a 
detectable level. 

30 

Detection of Amplified HCV Nucleic Acid Sequences 
derived from HCV Nucleic Acid Sequences in Liver and 
Plasma Specimens from Chimpanzees with NANBH 

HCV nucleic acids present in liver and plasma of 
35 chimpanzees with NANBH, and not in control chimpanzees, 
were amplified using essentially the polymerase chain re- 
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action (PCR) technique described by Saiki et al. (1986). 
The primer oligonucleotides were derived from the HCV cDNA 
sequences in clone 81 (Fig- 22) , or clones 36 (Fig. 23) 
and 37b (Fig. 24). The amplified sequences were detected 
5 by gel electrophoresis and a modified Southern blotting 
method, using as probes the appropriate cDNA oligomer or 
nick- translated cDNA sequence with a sequence from the 
region between, but not including, the two primers. 

Samples of RNA containing HCV sequences to be 

10 examined by the amplification system were isolated from 
liver biopsies of three chimpanzees with NANBH, and from 
two control chimpanzees. The isolation of the poly A + RNA 
fraction was by the guanidinium thiocyanate procedure 
described in Maniatis et al. (1982). 

15 Samples of RNA which were to be examined by the 

amplification system were also isolated from the plasmas 
of two chimpanzees with NANBH, and from one control 
chimpanzee, as well as from a pool of plasmas from control 
chimpanzees. One infected chimpanzee had a titer equal to 

20 or greater than 10 6 CID/ml, and the other infected 

chimpanzee had a titer equal to or greater than 10 5 CID/ 
ml. 

The nucleic acids were extracted from the plasma 
as follows. Either 0.1 ml or 0.01 ml of plasma was 

25 diluted to a final volume of 1.0 ml, with a TENB/ 

proteinase K/SDS solution (0.05 M Tris-HCL, pH 8.0, 0.001 
M EDTA, 0.1 M NaCl, 1 mg/ml Proteinase K, and 0.5% SDS) 
containing 10 micrograms /ml polyadenylic acid, and 
incubated at 37°C for 60 minutes. After this proteinase K 

30 digestion, the resultant plasma fractions were 

deproteinized by extraction with TE (10.0 mM Tris-HCl, pH 
8.0 f 1 mM EDTA) saturated phenol. The phenol phase was 
separated by centrif ugation, and was reextracted with TENB 
containing 0.1% SDS. The resulting aqueous phases from 

35 each extraction were pooled, and extracted twice with an 
equal volume of phenol/chloroform/isoamyl alcohol 
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[1:1(99:1)], and then twice with an equal volume of a 99:1 
mixture of chloroform/ isoamyl alcohol. Following phase 
separation by centrif ugation, the aqueous phase was 
brought to a final concentration of 0.2 M Na Acetate, and 
5 the nucleic acids were precipitated by the addition of two 
volumes of ethanol. The precipitated nucleic acids were 
recovered by ultracentrifugation in a SW 41 rotor at 38 K, 
for 60 minutes at 4°C. 

In addition to the above, the high titer 

10 chimpanzee plasma and the pooled control plasma 

alternatively were extracted with 50 micrograms of poly A 
carrier by the procedure of Chomcyzski and Sacchi (1987), 
This procedure uses an acid guanidinium thiocyanate 
extraction. RNA was recovered by centrif ugation at 10,000 

15 RPM for 10 minutes at 4°C in an Eppendorf microfuge. 

On two occasions, prior to the synthesis of cDNA 
in the PCR reaction, the nucleic acids extracted from 
plasma by the proteinase K/SDS/phenol method were further 
purified by binding to and elution from S arid S Elutip-R 

20 Columns. The procedure followed was according to the 
manufacturer's directions. 

The cDNA used as a template for the PCR reaction 
was derived from the nucleic acids (either total nucleic 
acids or RNA) prepared as described above. Following 

25 ethanol precipitation, the precipitated nucleic acids were 
dried, and resuspended in DEPC treated distilled water. 
Secondary structures in the nucleic acids were disrupted 
by heating at 65°C for 10 minutes, and the samples were 
immediately cooled on ice. cDNA was synthesized using 1 

30 to 3 micrograms of total chimpanzee RNA from liver, or 
from nucleic acids (or RNA) extracted from 10 to 100 
microliters of plasma. The synthesis utilized reverse 
transcriptase, and was in a 25 microliter reaction, using 
the protocol specified by the manufacturer, BRL. The 

35 primers for cDNA synthesis were those also utilized in the 
PCR reaction, described below. All reaction mixtures for 
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cDNA synthesis contained 23 units of the RNAase inhibitor, 
RNASIN 1 " ( Fisher /Promega ) . Following cDNA synthesis, the 
reaction mixtures were diluted with water, boiled for 10 
minutes, and quickly chilled on ice. 
5 The PCR reactions were performed essentially 

according to the manufacturer's directions (Cetus-Perkin- 
Elmer), except for the addition of 1 microgram of RNase A. 
The reactions were carried out in a final volume of 100 
microliters. The PCR was performed for 35 cycles, utiliz- 
10 ing a regimen of 37°C (2 min) , 72°C (3 min), and 94°C (1 
min) . 

The primers for cDNA synthesis and for the PCR 
reactions were derived from the HCV cDNA sequences in 
either clone 81, clone 36, or clone 37b, (The HCV cDNA 
15 sequences of clones 81, 36, and 37b are shown in Figs. 22 , 
23, and 24, respectively.) The sequences of the two 16- 
mer primers derived from clone 81 were: 

5 f CAA TCA TAC CTG ACA G 3' 
20 and 

5 ' GAT AAC CTC TGC CTG A3'. 

The sequence of the primer from clone 36 was: 

25 5" GCA TGT CAT GAT GTA T 3'. 

The sequence of the primer from clone 37b was: 

5 ' ACA ATA CGT GTG TCA C 3 ' . 

In the PCR reactions, the primer pairs consisted of either 
the two 16-mers derived from clone 81, or the 16-mer from 
clone 36 and the 16-mer from clone 37b. 

The PCR reaction products were analyzed by 
separation of the products by alkaline gel 

electrophoresis, followed by Southern blotting, and detec- 



WO 90/14436 PCT/US90/02853 

-93- 

32 

tion of the amplified HCV-cDNA sequences with a P- 
labeled internal oligonucleotide probe derived from a 
region of the HCV cDNA which does not overlap the primers. 
The PCR reaction mixtures were extracted with phenol/ 
5 chloroform, and the nucleic acids precipitated from the 
aqueous phase with salt and ethanol. The precipitated 
nucleic acids were collected by centrifugation, and dis- 
solved in distilled water. Aliquots of the samples were 
subjected to electrophoresis on 1.8% alkaline agarose 

10 gels. Single stranded DNA of 60 , 108 f and 161 nucleotide 
lengths were co-electrophoresed on the gels as molecular 
weight markers. After electrophoresis, the DNAs in the 
gel were transferred onto Biorad Zeta Probe* paper. 
Prehybridization and hybridization, and wash conditions 

15 were those specified by the manufacturer (Biorad). 

The probes used for the hybridization-detection 
of amplified HCV cDNA sequences were the following. When 
the pair of PCR primers were derived from clone 81, the 
probe was an 108-mer with a sequence corresponding to that 

20 which is located in the region between the sequences of 
the two primers. When the pair of PCR primers were 
derived from clones 36 and 37b, the probe was the nick- 
translated HCV cDNA insert derived from clone 35, the 
nucleotide sequence of which is shown in Fig. 34. The 

25 primers are derived from nucleotides 155-170 of the clone 
37b insert, and 206-268 of the clone 36 insert. The 3'- 
end of the HCV cDNA insert in clone 35 overlaps 
nucleotides 1-186 of the insert in clone 36; and the 5'- 
end of clone 35 insert overlaps nucleotides 207-269 of the 

30 insert in clone 37b. (Compare Figs. 23, 34 and 24.) Thus, 
the cDNA insert in clone 35 spans part of the region 
between the sequences of the clone 36 and 37b derived 
primers, and is useful as a probe for the amplified 
sequences which include these primers. 

35 Analysis of the RNA from the liver specimens was 

according to the above procedure utilizing both sets of 
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primers and probes. The RNA from the liver of the three 
chimpanzees with NANBH yielded positive hybridization 
results for amplification sequences of the expected size 
(161 and 586 nucleotides for 81 and 36 and 37b, 
5 respectively) , while the control chimpanzees yielded 
negative hybridization results . The same results were 
achieved when the experiment was repeated three times. 



plasma was also according to the above procedure utilizing 
10 the primers and probe from clone 81. The plasmas were 

from two chimpanzees with NANBH , from a control 

chimpanzee, and pooled plasmas from control chimpanzees. 

Both of the NANBH plasmas contained nucleic acids /RNA 

which yielded positive results in the PCR amplified assay, 
15 while both of the control plasmas yielded negative 

results . These results have been repeatedly obtained 

several times . 



RNA viruses. By using PCR technology it is possible to 

20 design primers to amplify sequences of the HCV genome. By 
analysis of the amplified products, it is expected to be 
able to identify both defective versions of the viral 
genome as well as wild-type viral species. Accordingly, 
using two primers based on known HCV sequence, one can 

25 predict accurately the expected size of the PCR product. 
Any larger species observed by gel electrophoresis and 
hybridization analysis could represent potential variant 
genomes. Alternatively, any smaller species observed in 
this fashion might represent defective agents . Analyses 

30 of these types would be useful in confirming the exact 
origin of the known HCV sequence, whether it is indeed a 
wild-type viral sequence or a defective genome. 
Techniques and methods for these analyses are well known 
in the art and have been previously described. This 

35 methodology will enable one skilled in the art to obtain 



Analysis of the nucleic acids and RNA from 



Defective viruses have been known to occur in 
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related (wild-type or defective) forms of the viral 
genome . 

Detection of Sequences in Captured Particles 
5 Which When Amplified by PCR 

Hybridize to HCV cDNA Derived from Clone 81 

The RNA in captured particles was obtained as 
described below. The analysis for sequences which hybrid- 
ize to the HCV cDNA derived from clone 81 was carried out 

10 utilizing the PCR amplification procedure, as described 
supra . , except that the hybridization probe was a kinased 
oligonucleotide derived from the clone 81 cDNA sequence. 
The results showed that the amplified sequences hybridized 
with the HCV cDNA probe. 

15 Particles were captured from HCV infected 

chimpanzee plasma using polystyrene beads coated with an 
immunopurif ied antibody directed against the polypeptide 
encoded in clone 5-1-1. The procedure for producing the 
immunopurif ied antibody preparation is described in E.P.O. 

20 Publication No. 318,216, which is commonly owned by the 
herein assignee, and which is incorporated herein by 
reference. Briefly, the HCV polypeptide encoded within 
clone 5-1-1 was expressed as a fusion polypeptide with 
superoxide dismutase (SOD). This was accomplished by 

25 subcloning the clone 5-1-1 cDNA insert into the expression 
vector pSODcfl (Steimer et al. (1986)). DNA isolated from 
pSODcfl was treated with BamHI and EcoRI, and the follow- 
ing linker was ligated into the linear DNA created by the 
restriction enzymes: 

30 

5' GAT CCT GGA ATT CTG ATA AGA 

CCT TAA GAC TAT TTT AA 3' 



35 



After cloning, the plasmid containing the insert was 
isolated. Plasmid containing the insert was restricted 
with EcoRI. The HCV cDNA insert in clone 5-1-1 was 



# 
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excised with EcoRI, and ligated into this EcoRI linearized 
plasmid DNA. The DNA mixture was used to transform E. 
coli strain D1210 (Sadler et al. (1980)). Recombinants 
with the 5-1-1 cDNA in the correct orientation for expres- 
5 sion of the ORF were identified by restriction mapping and 
nucleotide sequencing. Recombinant bacteria from one 
clone were induced to express the SOD-NANB^j^ 
polypeptide by growing the bacteria in the presence of 
IPTG. The fusion polypeptide was purified from the re- 

10 combinant E. coli by differential extraction of the cell 
extracts with urea, followed by chromatography on anion 
and cation exchange columns. The purified SOD-NANB^j^ 
polypeptide was attached to a nitrocellulose membrane. 
Antibody in samples of HCV infected serum was absorbed to 

15 the matrix-bound polypeptide. After washing to remove 
non-specif ically bound materials and unbound materials , 
the bound antibody was released from the bound 
polypeptide . 

20 cPCR Method to Detect HCV RNA in Liver 



of the PCR assay, i.e., a cPCR assay, for detecting HCV 
infection was determined by performing the assay on total 

25 liver RNA and on serum from infected individuals. In the 
cPCR assay, putative viral RNA in the sample is reverse 
transcribed into cDNA with reverse transcriptase; a seg- 
ment of the resulting cDNA is then amplified utilizing a 
modified version of the PCR technique described by Saiki 

30 et al. (1986). The primers for the cPCR technique are 

derived from HCV RNA, which can be identified by the fam- 
ily of HCV cDNAs provided herein. Amplified product cor- 
responding to the HCV-RNA is detected utilizing a probe 
derived from the family of HCV cDNAs provided herein . 

35 The cPCR/HCV assay used in these studies were 

performed utilizing the following methods for the prepara- 



and in Serum from Individuals with NANBH . 

The reliability and utility of a modified form 
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tion of RNA f the reverse transcription of the RNA into 
cDNA, the amplification of specific segments of the cDNA 
by PCR, and the analysis of the PCR products. 

RNA was extracted from liver utilizing the 
5 guanidium isothiocyanate method for preparing total RNA 
described in Maniatis et al. (1982). 

In order to isolate total RNA from plasma, the 
plasma was diluted five- to ten- fold with TENB (0.1 M 
NaCl, 50 mM Tris-HCl, pH 8.0, 1 mM EDTA) and incubated in 

10 a Proteinase K/SDS solution (0.5% SDS, 1 mg/ml Proteinase 
K, 20 micrograms /ml Poly A carrier) for 60 to 90 minutes 
at 37°C. The samples were extracted once with phenol (pH 
6.5), the resulting organic phase was re-extracted once 
with TENB containing 0.1% SDS, and the aqueous phases of 

15 both extractions were pooled and extracted twice with an 
equal volume of phenol/CHCl 3 /isoamyl alcohol [1:1(99:1)]. 
The resulting aqueous phases were extracted with an equal 
volume of ChCl 3 /isoamyl alcohol (99:1) twice, and ethanol 
precipitated using 0.2 M sodium acetate, pH 6.5, and 2.5 

20 volumes of 100% ethanol; precipitation was overnight at - 
20°C. 

The cDNA used as a template for the PCR reaction 
was prepared utilizing the designated samples for prepara- 
tion of the corresponding cDNAs. Each RNA sample 

25 (containing either 2 micrograms of heat denatured total 

chimpanzee liver RNA, RNA from 2 microliters of plasma, or 
10% of the RNA extracted from 10mm X 4 mm cylindrical hu- 
man liver biopsies) was incubated in a 25 microliter re- 
action containing 1 micromolar of each primer , 1 

30 millimolar of each deoxyribonucleotide triphosphate 
(dNTP), 50 millimolar Tris-HCL, pH 8.3, 5 millimolar 
MgCl 2 , 5 millimolar dithiothreitol (DTT), 73 millimolar 
KC1, 40 units of RNase inhibitor ( RNASIN ) , and 5 units of 
AMV reverse transcriptase. The incubation was for 60 

35 minutes at 37°C. Following cDNA synthesis, the reactions 
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were diluted with 50 microliters of deionized water (DIW), 
boiled for 10 minutes, and cooled on ice. 

Amplification of a segment of the HCV cDNA was 
performed utilizing two synthetic oligomer 16-mer primers 
5 whbse sequences were derived from HCV cDNA clones 36 

(anti-sense) and 37b (sense). The sequence of the primer 
from clone 36 was: 

5' GCA TGT CAT GAT GTA T 3'. 

10 

The sequence of the primer from clone 37b was: 
5 ' ACA ATA CGT GTG TCA C 3', 

15 The primers were used at a final concentration of 1 

micromolar each. In order to amplify the segment of HCV 
cDNA which is flanked by the primers, the cDNA samples 
were incubated with 0.1 microgram of RNAse A and the PCR 
reactants of the Perkin Elmer Cetus PCR kit (N801-0043 or 

20 N801-0055) according to the manufacturer's instructions. 
The PCR reaction was performed for either 30 cycles or 60 
cycles in a Perkin Elmer Cetus DNA thermal cycler. Each 
cycle consisted of a 1 minute denaturation step at 94°C f 
an annealing step of 2 minutes at 37°C, and an extension 

25 step of 3 minutes at 72°C. However, the extension step in 
the final cycle (30 or 60) was 7 minutes rather than 3 
minutes. After amplification the samples were extracted 
with an equal volume of phenol: chloroform (1:1), followed 
by extraction with an equal volume of chloroform, and then 

30 the samples were precipitated with ethanol containing 0.2 
M sodium acetate. 

The cPCR products were analyzed as follows. The 
products were subjected to electrophoresis on 1.8% 
alkaline agarose gels according to Murakawa et al. (1988), 

35 and transferred onto Zeta" Probe paper (BioRad Corp.) by 
blotting gels overnight in 0.4 M NaOH. The blots were 
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neutralized in 2 X SSC (1 X SSC contains 0.15 M NaCl, 

0.015 M sodium citrate), prehybridized in 0.3 M NaCl, 15 

mM sodium phosphage buffer, pH 6.8, 15 mM EDTA, 1.0% SDS, 

0.5% nonfat milk (Carnation Co.), and 0.5 mg/ml sonicated 

5 denatured salmon sperm DNA. The blots to be analyzed for 

32 

HCV cDNA fragments were hybridized to a P-labeled probe 
generated by nick translation of the HCV cDNA insert 
sequence in clone 35, described in E.P.O. Publication No. 
318,216. After hybridization, the blots were washed in 

10 0.1 X SSC (1 X SSC contains 0.15M NaCl, 0.0 1M Na citrate) 
at 65°C, dried, and autoradiographed . The expected 
product size is 586 nucleotides in length; products which 
hybridized with the probe and migrated in the gels in this 
size range were scored as positive for viral RNA. 

15 As a control, cPCR primers designed to amplify 

alpha-1 anti-trypsin mRNA was performed to verify the 
presence of RNA in each sample analyzed. The coding 
region of the alpha-1 anti-trypsin gene is described in 
Rosenberg et al. (1984). Synthetic oligomer 16-mer prim- 

20 ers designed to amplify a 365 nucleotide fragment of the 

coding region of the alpha-1 antitrypsin gene were derived 

from nucleotides 22-37 (sense) and nucleotides 372-387 

32 

(antisense). The PCR products were detected using a P 
nick-translated probe which lies between, and not includ- 
25 ing, the cDNA/PCR primer sequences. 



action, all samples were run a minimum of three times. 
All false positive signals were eliminated when the fol- 
lowing precautions were taken: 1) eliminating aerosols by 
30 using screw capped tubes with rubber O-ring seals; 2) 
pipetting with Ranin Microman positive displacement 
pipetters with disposable pistons/capillaries; and 3) 
selecting the oligonucleotide sequences for the cDNA and 
PCR primers from two non-contiguous cDNA clones. 



Due to the extreme sensitivity of the PCR re- 
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Detection of HCV RNA in Liver Samples by a cPCR Method 
The cPCR assay was performed on total RNA 
isolated from livers of three chimpanzees experimentally 
5 infected with a NANBH agent , and from liver biopsies of 
Italian patients diagnosed as having chronic NANBH, 

Fig. 25A shows the results of the cPCR assay 
using 1 microgram of each preparation of total liver RNA. 
The RNA was isolated from liver samples of a chimpanzee in 
10 the chronic phase of NANBH (910) (lane 1), two chimpanzees 
in the acute phase of infection (1028 and 508) (lanes 2 and 
3, respectively). PCR was performed on the samples in 
lanes 1-3 for 30 cycles and the autoradiogram of the blot 
containing those lanes was exposed for 5 hours. cDNA from 
15 1 microgram of total RNA from acutely infected animal 1028 
(lane 4), and three uninfected chimpanzees (lanes 5-7), 

were amplified for 60 cycles and the autoradiograms 

32 

containing those lanes were exposed for 7 days. P 
labeled Mspl-digested pBR322 DNA served as markers on all 

20 the autoradiograms. It may be seen from the results that 
cDNA corresponding to HCV RNA was seen only in the samples 
from chimpanzees with NANBH f whether acute or chronic 
(lanes 1, 3, and 4). The cPCR products in these lanes 
migrated between marker fragments of 527 and 622 

25 nucleotides (not shown) . 

Fig. 25B shows the results of the cPCR assay 
using 10% of the RNA extracted from 10mm X 4mm liver 
biopsy cylinders from 15 chronic NANB patients (lanes 1- 
15), one patient with cryptogenic liver disease (lane 16) 

30 and one control sample from a patient with chronic 

Hepatitis B (lane 17). Amplification by PCR was for 30 
cycles and the autoradiogram for the blots were exposed 
for 4 days, except that lane 1 was exposed for 15 hours. 
As seen from the results, 9/15 (60%) of the human samples 

35 were positive for HCV RNA (lanes 1,2,4,6,7,10-13). One 

patient diagnosed with cryptogenic liver disease (lane 16) 
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and one patient with a chronic HBV infection (lane 17) 
were repeatedly negative in the cPCR assay. 



Comparison of the HCV/cPCR Assay on Human Liver Biopsies 
5 and RIA of Serum Using HCV C100-3 Polypeptide 

SOD/HCV C100-3 polypeptide (also called C100) is 
a recombinant fusion polypeptide which contains 363 viral 
amino acids. The polypeptide is useful for detecting 
antibodies to HCV (See Kuo et al. (1989)). The method for 
10 preparing C100 is described in E.P.O. Publication No. 
318,216. 

Radioimmune assay using C100 was performed on 
the sera collected from the same 17 human patients whose 
liver samples were subjected to HCV/cPCR assay as 

15 described supra. The sera was collected on the same day 
as the liver biopsies. The assay was performed es- 
sentially as described in E.P.O. Publication No. 318, 216 , 
which is commonly owned and incorporated herein by refer- 
ence. Briefly, Microtiter plates (Immulon 2, Removeawell 

20 strips) were coated with 0.1 microgram of purified C100. 
The coated plates were incubated for 1 hour at 37°C with 
the serum samples (100 microliters of a 1:100 dilution) or 
appropriate controls. After incubation, the unbound 
material was removed , the plates were washed, and 

25 complexes of human antibody-ClOO were detected by incuba- 
tion with I-labeled sheep anti-human immunoglobulin. 
Unbound labeled antibody was removed by aspiration, and 
the plates were washed. The radioactivity in individual 
wells was determined. 

30 The results of the RIA showed that sixty-seven 

percent of these samples were positive for anti-ClOO anti- 
bodies. Sera from the patient diagnosed with cryptogenic 
liver disease was positive for anti-ClOO antibodies, 
although the levels of viral RNA were undetectable in the 

35 patient's liver in this sample. The level of correlation 
between the presence of anti-ClOO antibodies and HCV RNA 
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was seventy percent; two patients who were negative for 
antibodies by RIA had significant levels of HCV RNA in 
their livers (data not shown). 

The results indicate that virus is frequently 
5 present in the liver of patients with circulating anti- 
C100 antibodies, and confirms claims that the presence of 
anti-ClOO antibodies accurately reflects exposure to HCV. 
Moreover , taken together , these results indicate that HCV 
of this type accounts for NANBH in at least 75% of the 
10 patients in this study, and that the predominant strain of 
HCV in Italy appears to be closely related to the strain 
of HCV prevalent in the United States. 



HCV/cPCR Assay of Sera; Detection of Viral RNA 

15 in Acute Phase Infection in Chimpanzees 

The temporal relationship between the display of 
liver damage, the presence of HCV RNA, and the presence of 
anti-HCV antibodies was monitored in serum from two 
experimentally infected chimpanzees with NANBH (nos. 771 

20 and 910). Liver damage was determined by alanine amino 
transferase (ALT) levels; the presence of HCV RNA was 
determined by the HCV cPCR assay described above; anti-HCV 
antibodies were detected utilizing the C100 RIA. 

The HCV/cPCR analysis was performed on RNA 

25 extracted from 1 microliter of chimpanzee plasma. Serum 
was taken from chimpanzee 771 on days 25, 32, 70 and 88 
post-infection; cPCR was performed for 30 cycles and the 
autoradiogram was exposed for 18 days. Serum was taken 
from chimpanzee 910 on days 11, 28, and 67 post-infection; 

30 cPCR was performed for 60 cycles and the autoradiogram was 
exposed for 5 days . 

The results of the assays are shown in Fig. 26A 
for chimpanzee 771, and Fig. 26B for chimpanzee 910. From 
a comparison of Figs. 26A and 26B, it appears that an 

35 early, well defined peak of ALT values during acute 
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hepatitis correlates with the presence of viral RNA in the 
infected individual. 

The data also indicate that the presence of HCV 
RNA, which is indicative of a state of viremia, precedes 
5 the presence of anti-HCV antibodies. Chimpanzee 771 (Fig. 
26A) exhibited a clearly defined acute episode of post- 
transfusion NANBH at 28 days, as characterized by an 
initial peak of ALT levels. HCV RNA was detected in the 
serum collected at day 25, and at day 32. However, during 

10 this acute phase, anti-HCV antibodies were absent. In 
contrast, at day 70 HCV RNA was below the experimental 
level of detection, and anti-HCV antibodies were rising. 
At day 88, HCV RNA remained undetectable, while anti-HCV 
antibodies were significantly increased over that of day 

15 70. 

The results obtained from the sera of chimpanzee 
910 were somewhat similar in pattern, although the time of 
HCV antibodies induced by the infection were not detected 
during the acute phase of the disease, which extended to 

20 at least day 67; the anti-HCV antibodies detected by RIA 
at day 11 were due to passive immunization of animal 910 
with antibodies from the plasma used to inoculate the 
animal. Anti-HCV antibodies were found in chimpanzee 910 
serum during the later, chronic phase of the infection 

25 (data not shown). 

It should be noted that low ALT values in plasma 
from individuals with chronic NANBH do not necessarily 
correlate with weak virus production. A pool of 17 dif- 
ferent plasma samples taken from chimpanzee 910 over a 

30 period of two to three and one-half years post inoculation 
was monitored for ALT levels and for HCV RNA. The ALT 
values of the samples did not exceed 4 5 mU/ml; neverthe- 
less, titration studies indicated high titers of HCV (3 x 
10^ CID/ml). cPCR was carried out for 30 cycles, and the 

35 autoradiogram was exposed for 15 hours; the cPCR analysis 
clearly showed the presence of viral RNA (data not shown). 
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HCV/cPCR Assay of Sera: Detection of Viral RNA 

in Acute Phase Infection in Humans 

Plasma from a human surgical patient collected 

5 during early acute NANBH was examined for HCV RNA and for 

anti-HCV antibodies, utilizing the HCV/cPCR assay and 

C100-RIA, respectively. The HCV/cPCR assay was conducted 

utilizing 1 microliter of plasma from the patient, and 

from four human controls with known pedigrees; cPCR was 

10 performed for thirty cycles, and after hybridization and 

washing the autoradiogram was exposed for eight hours . 

The results showed that the serum collected from 

the surgical patient during the acute phase of infection 

contained a high level of viral RNA, and that anti-HCV 

15 antibodies were not detectable by the C100-RIA (data now 

shown) . (The acute phase plasma from the surgical patient 

was known to have a high titer of NANBH infectious agent 
6 5 

[10 * CID/ml, as determined by Feins tone et al. (1981); 
Feinstone et al. (1983)]). It should be noted, however, 
20 that this patient did sero-covert to anti-HCV antibodies 
by the C100-RIA approximately 9 months after infection. 
The serum from the pedigreed human control plasmas were 
negative in both the HCV/cPCR assay and C100-RIA. 

25 Sensitivity of HCV/cPCR Assay 

The sensitivity of the HCV/cPCR assay was 
determined by analyzing ten-fold serial dilutions of a 
plasma pool of known titer. The chimpanzee plasma had a 
titer of ~3 x 10 5 CID/ml, and RNA was extracted from ten- 

30 fold dilutions of 1 microliter of the plasma- cPCR was 

performed for 30 cycles, and after hybridization and wash- 
ing, the autoradiogram was exposed for 15 hours. The cPCR 
products resulting from amplification of ~300, ~30, and ~3 
CID of HCV genomes are shown in lanes 1-3, respectively of 

35 Fig. 29. The samples in lanes 1 and 2 were detectable on 
autoradiograms exposed for 2 hours . 
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Since the average titer of HCV in infected 
individuals is believed to be between approximately 100 to 
10,000 CID/ml of plasma, this data suggests that the HCV/ 
cPCR assay may be clinically useful. 

5 

HCV/cPCR Assay for Variant HCV Strains 
Primers, consisting of a set of oligomer 44-mers 
and a set of oligomer 45-mers, were designed to amplify 
strains of HCV which are similar or identical to the HCV 

10 isolate from which the cDNA sequence in Fig. 18 is 

derived. The premise underlying the design of these prim- 
ers is our discovery that HCV is a Flavi-like virus. 
Members of the Flaviviridae family, when compared to HCV, 
have two major conserved sets of amino acid sequences, 

15 TATPPG and QRRGR, in the putative NS3 region of these 
viruses. Several other smaller sets may be seen, for 
example, GDD in the putative NS5 region. Other sets are 
determinable by comparison of the known amino acid 
sequences with that of HCV. This information was deduced 

20 from the sequences for several members of Flaviviridae 

which have been described, including Japanese Encephalitis 
Virus (Sumiyoshi et al . (1987)), Yellow Fever Virus (Rice 
et al. (1985)), Dengue Type 2 Virus (Hahn et al. (1988)), 
Dengue Type 4 Virus (Mackow (1987)), and West Nile Virus 

25 (Castle et al. (1986)). The conserved amino acid 

sequences and codon utilization are in the table im- 
mediately following. 



30 
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Conserved Amino Acid (A. A. ) Sequences 
Among Flaviviruses and HCV 

# of A. A. 



5 



15 



Virus 


first A. A. 




T 


A 


T 


P 


P 


G 


HCV 


1348 


5' 


ACC 


GCC 


ACC 


CCT 


CCG 


GCC 3' 


Yellow Fever 1805 




ACA 


GCC 


ACA 


CCG 


CCT 


GGG 


West Nile 


1818 




ACG 


GCA 


ACG 


CCA 


CCC 


GGG 


Dengue-4 


1788 




ACC 


GCA 


ACC 


CCT 


CCC 


GGA 


JEV 


1957 




ACA 


GCG 


ACC 


CCG 


CCT 


GGA 


HCV sense 


primer ( 4 4 mer ) 




















5' 


ACC 


GCC 


ACC 


CCX 


CC 3' 












(X = A, 


rT,C, 


or G) 






# of 








A. A. 






Virus 


first A. A. 




Q 


R 


R 


G 


R 




HCV 


1486 


5' 


CAA 


CGT 


CGG 


GGC 


AGG 


3' 


Yellow Fever 1946 




CAA 


AGG 


AGG 


GGG 


CGC 




West Nile 


1959 




CAG 


CGG 


AGA 


GGA 


CGC 




Dengue-4 


1929 




CAG 


AGA 


AGA 


GGG 


CGA 




JEV 


1820 




CAA 


CGG 


AGG 


GGC 


AGA 





HCV antisense primer (4 Smew- 
s' GTX GCA GCC CCG TCC 5' 
(X = T or C) 

20 

Note: the primer sequence was chosen to minimize the 
number of nucleotide degeneracies at the 3 '-end of the 
primer sequence and to maximize the number of nucleotides 
at the 3 '-end of each primer which exactly match any of 
the possible nucleotide sequences, or the complement 
thereof , encoding the conserved amino acids indicated 
25 above. 

The 44-mer and 45-mer oligomer primers were 
designed so that the sequences encoding these amino acids 
were incorporated within the primer. Moreover, they 
contain degeneracies at the 3 '-end of each primer, and are 

3 0 

derived from two different regions of the HCV genome which 
are present in clone 40b (See Fig. 28), and which are 
derived from the region encoding putative NS3 of HCV. The 
formulae for the oligonucleotide primers in the sets are: 
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5' GAC TGC GGG GGC GAG ACT GGT TGT GCT CGC 
ACC GCC ACC CCX CC 3' 

5 where X is A,T,G, or C; and 

5' TCT GTA GAT GCC TGG CTT CCC CCT GCC AGT 
CCT GCC CCG ACT YTG 3' 

10 where Y is T or C. 

The HCV/cPCR assay was carried out utilizing 
these primers to amplify HCV RNA in chimpanzee 910 plasma. 
The assay method was essentially as described in Section 
supra., except that the 44-mer and 45-mer sets of oligomer 

15 primers were substituted for the primers derived from 

clone 36 and clone 37b. In addition, detection of ampli- 
fied HCV cDNA was by hybridization with a probe derived 
from clone 40a, the sequence of which is shown in Fig. 32. 

The probe was prepared by amplifying a segment 

20 of clone 40a utilizing the PCR method described supra., 
and 18-mer primers containing the following sequences: 

5' GAG ACA TCT CAT CTT CTG 3' 

25 and 

5 ' GAG CGT GAT TGT CTC AAT 3 ' . 

After amplification, the probe preparation was labeled 

32 

30 with P by nick translation. 

Fig. 33 shows an autoradiograph of the Southern 

blots probed with the sequence derived from Clone 4 0a. 
32 

P labeled Mspl digested pBR322 DNA fragments served as 
markers (lane 1). The predicted size of the PCR product 
35 resulting from amplification using these primers is 490 
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nucleotides (nt). Duplicate reactions are shown in lanes 
2 and 3. 

< 

Analysis for Variants of the 5' -Region of HCV 
5 Based upon the Flavivirus model, the 5' -region 

HCV cDNA which is flanked by the regions represented in 
clones ag30a and k9-l encodes a segment of putative 
envelope and/or matrix protein(s) (E/M) . Serum obtained 
from the chimpanzee from which the HCV cDNA "c" library, 
10 was constructed was analyzed by HCV/cPCR to determine 
whether variants within this target region were present. 

The HCV/cPCR assay was performed essentially as 
described supra., for the isolation of clone 5 '-32, except 
for the primers and probes used. Fig. 37 shows the 
15 relationship of the primers and probes (and the clones 

from which they were derived) to that of the target region 
of HCV cDNA. One set of PCR primers, ag30al6A and 
K91Envl6B, were derived from clones ag30a and k9-l, which 
are upstream and downstream, respectively, of the target 
20 sequence. The expected size of the cPCR product primed by 
ag30al6A and K91Envl6B is 1.145 kb based upon the 
confirmed sequence of HCV cDNA. Two other sets of PCR 
primers covering the region amplified using ag30al6A and 
K91Envl6B, and overlapping each other were also used for 
25 PCR amplification of HCV RNA in the serum. Thus, in this 
case the PCR reactions were run using as one set of prim- 
ers ag30al6A and CA156el6B, and as the second set of prim- 
ers CA156el6A and k91Envl6B. The expected PCR product 
sizes for these pairs were 615 nucleotides (NT) and 683 
30 NT, respectively. The table immediately following lists 
the primer, the clone from which it was derived, and the 
primer sequence. 
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Table 



Primer 


Clone 






Sequence 








ag30al6A 


ag30a 


5' 


CTC 


TAT 


GGC 


AAT 


GAG 


G 


3' 


K91Envl6B 


k9-l 


5' 


CGT 


TGG 


CAT 


AAC 


TGA 


T 


3' 


CA156el6B 


156 


5' 


CGA 


CAA 


GAA 


AGA 


CAG 


A 


3' 


CA156el6A 


156 


5' 


AGC 


TTC 


GAC 


GTC 


ACA 


T 


3' 


CA216al6A 


216 


5' 


TGA 


ACT 


ATG 


CAA 


CAG 


G 


3' 


CA216al6B 


216 


5' 


GGA 


GTG 


TGC 


AGG 


ATG 


G 


3' 


CA84al6A 


84 


5' 


AAG 


GTT 


GCA 


ATT 


GCT 


C 


3' 


CA84al6B 


84 


5' 


ACT 


AAC 


AGG 


ACC 


TTC 


G 


3' 



The probes for all of the HCV/cPCR products consisted of 

15 32 P labeled sections of HCV cDNA which had been prepared 

by PCR amplification of a region of clone 216 (using 

CA216al6A and 216al6B as primers), and of clone 84 (using 

32 

CA84al6A and CA84al6B as primers); P was introduced into 
the PCR products by nick translation. These probes did 

20 not overlap the primers used in the HCV/cPCR reactions. 

Fig. 38 shows an autoradiograph of a Southern 
blot in which the HCV/cPCR products were hybridized with 
the 32 P-labeled probes. The HCV/cPCR product extended 
from primers ag30al6A and K91Envl6B (lane 1) was ap- 

25 proximately 1.1Kb; no other PCR products were observed in 
a 15 hour exposure. The HCV products extended from the 
primer sets ag30a!5A/CA156el6B (lane 2) and CA156el6A/ 
K91Envl6B (lane 3) were approximately 625NT and ap- 
proximately 700 NT, respectively. The size of the PCR 

30 products were determined by comparison with the relative 
migrations of fragments resulting from the digestion of 
pBR322 with Mspl and of PhiX 174 digested with Haelll 
(lane 5) . 

The above study will detect insertions or 
35 deletions as small as approximately 20NT to 50NT and DNA 
rearrangements altering the size of the target DNA. The 
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results in Fig. 38 confirm that there is only 1 major spe- 
cies of cDNA derived from the E/M region of the HCV in the 
chimpanzee serum. 

5 Amplification for Cloning of HCV cDNA Sequences 

Utilizing the PCR and Primers Derived from 
Conserved Regions of Flavivirus Genomic Sequences 

Our discovery that HCV is a flavi-like virus, 
allows a strategy for cloning uncharacterized HCV cDNA 

10 sequences utilizing the PCR technique , and primers derived 
from the regions encoding conserved amino acid sequences 
in f laviviruses . Generally, one of the primers is derived 
from a defined HCV genomic sequence, and the other primer 
which flanks a region of unsequenced HCV polynucleotide is 

15 derived from a conserved region of the flavivirus genome. 
The flavivirus genomes are known to contain conserved 
sequences within the NS1, and E polypeptides, which are 
encoded in the 5 '-region of the flavivirus genome. Thus, 
to isolate cDNA sequences derived from putatively 

20 comparable regions of the HCV genome, upstream primers are 
designed which are derived from the conserved sequences 
within these flavivirus polypeptides . The downstream 
primers are derived from an upstream end of the known por- 
tion of the HCV cDNA. 

25 Because of the degeneracy of the code, it is 

probable that there will be mismatches between the 
flavivirus probes and the corresponding HCV genomic 
sequence. Therefore a strategy which is similar to the 
one described by Lee (1988) is used. The Lee procedure 

30 utilizes mixed oligonucleotide primers complementary to 
the reverse translation products of an amino acid 
sequence; the sequences in the mixed primers takes into 
account every codon degeneracy for the conserved amino 
acid sequence. 

35 Three sets of primer mixes are generated, based 

on the amino acid homologies found in several 
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f laviviruses, including Dengue-2,4 (D-2,4), Japanese 
Encephalitis Virus (JEV), Yellow Fever (YF), and West Nile 
Virus (WN) . The primer mixture derived from the most 
upstream conserved sequence (5 r -l), is based upon the 
5 amino acid sequence gly-trp-gly, which is part of the 

conserved sequence asp-arg-gly-trp-gly-aspN found in the E 
protein of D-2, JEV, YF, and WN- The next primer mixture 
(5 '_2) is based upon a downstream conserved sequence in E 
protein, phe-asp-gly-asp-ser-tyr-ileu-phe-gly-asp-ser-tyr- 

10 ileu, and is derived from phe-gly-asp; the conserved 
sequence is present in D-2, JEV, YF, and WN. The third 
primer mixture (5 '-3), is based on the amino acid sequence 
arg-ser-cys, which is part of the conserved sequence cys- 
cys-arg-ser-cys in the NS1 protein of D-2, D-4 , JEV, YF, 

15 and WN. The individual primers which form the. mixture in 
5 '-3 are shown in Fig. 53. In addition to the varied 
sequences derived from conserved region, each primer in 
each mixture also contains a constant region at the 5 '-end 
which contains a sequence encoding sites for restriction 

20 enzymes, Hindlll, Mbol, and EcoRI . 

The downstream primer, ssc5h20A, is derived from 
a nucleotide sequence in clone 5h, which contains HCV cDNA 
with sequences with overlap those in clones 14i and lib. 
The sequence of ssc5h20A is 

25 

5 ' GTA ATA TGG TGA CAG AGT CA 3 ' . 

An alternative primer, ssc5h34A, may also be used. This 
primer is derived from a sequence in clone 5h f and in ad- 
30 dition contains nucleotides at the 5 ' -end which create a 
restriction enzyme site, thus facilitating cloning. The 
sequence of ssc5h34A is 

5 ' GAT CTC TAG AGA AAT CAA TAT GGT GAG AGA GTC A3'. 

35 



WO 90/14436 



-112- 



PCT/US90/02853 



The PCR reaction, which was initially described 
by Saiki et al. (1986), is carried out essentially as 
described in Lee et al. (1988), except that the template 
for the cDNA is RNA isolated from HCV infected chimpanzee 
5 liver, or from viral particles isolated from HCV infected 
chimpanzee serum. In addition, the annealing conditions 
are less stringent in the first round of amplification 
(0.6M NaCl, and 25°C) , since the part of the primer which 
will anneal to the HCV sequence is only 9 nucleotides, and 

10 there could be mismatches. Moreover, if ssc5h34A is used, 
the additional sequences not derived from the HCV genome 
tend to destabilize the primer-template hybrid. After the 
first round of amplification, the annealing conditions can 
be more stringent (0.066M NaCl, and 32°C-37°C), since the 

15 amplified sequences now contain regions which are com- 
plementary to, or duplicates of the' primers. In addition, 
the first 10 cycles of amplification are run with Klenow 
enzyme I, under appropriate PCR conditions for that 
enzyme. After the completion of these cycles, the samples 

20 are extracted, and run with Taq polymerase, according to 
kit directions, as furnished by Cetus/Perkin-Elmer . 

After the amplification, the amplified HCV cDNA 
sequences are detected by hybridization using a probe 
derived from clone 5h. This probe is derived from 

25 sequences upstream of those used to derive the primer, and 
does not overlap the sequences of the clone 5h derived 
primers. The sequence of the probe is 

5' CCC AGC GGC GTA CGC GCT GGA CAC GGA GGT GGC CGC GTC 
30 GTG TGG CGG TGT TGT TCT CGT CGG GTT GAT GGC GC 3 ' . 
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Industrial Applicability 
The methods described herein, as well as the 
oligomers, both probes and primers, derived from HCV cDNA, 
5 and kits containing them, are useful for the accurate, 

relatively simple, and economic determination of the pres- 
ence of HCV in biological samples, more particularly in 
blood which may be used for transfusions, and in 
individuals suspected of having HCV an infection. More- 

10 over, these methods and oligomers may be useful for 

detecting an earlier stage of HCV infection than are im- 
munological assays based upon the use of a recombinant HCV 
polypeptides . Also, an amplified polynucleotide 
hybridization assay detects HCV RNA in occasional samples 

15 which are anti-HCV antibody negative. Thus, the probes 
and primers described herein may be used amplified 
hybridization assays, in conjunction with an immunoassays 
based on HCV polypeptides to more completely identify 
infections due to HCV, and HCV-infected biological 

20 specimens, including blood. 

The information provided herein allows the 
design of primers and/or probes which are derived from 
conserved regions of the HCV genome. The provision of 
these primers and probes makes available a general method 

25 which will detect variant HCV strains, and which will be 
of use in the screening of blood and blood products . 

If the primers used in the method are derived 
from conserved regions of the HCV genome, the method 
should aid in the detection and/or identification of 

30 variant strains of HCV. This, in turn, should lead to the 
development of additional immunological reagents for the 
detection and diagnosis of HCV, as well as the development 
of additional polynucleotide reagents for detection and or 
treatment of HCV. 

35 In addition, sets of primers and probes designed 

from the conserved amino acid sequences of Flaviviruses 
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and HCV allow for a universal detection method for these 
infectious agents. 

The following listed materials are on deposit 
under the terms of the Budapest Treaty with the American 
Type Culture Collection (ATCC), 12301 Parklawn Dr., 
Rockville, Maryland 20852, and have been assigned the fol- 
lowing Accession Numbers. 





lambda-gtll 




ATCC No. 


Deposit 


Date 


10 


HCV cDNA library 


40394 


1 Dec. 


1987 




clone 81 




40388 


17 Nov. 


1987 




clone 91 




40389 


17 Nov. 


1987 




clone 1-2 




40390 


17 Nov. 


1987 




clone 5-1-1 




40391 


18 Nov. 


1987 


15 


clone 12 f 




40514 


10 Nov. 


1988 




clone 35 f 




40511 


10 Nov. 


1988 




clone 15e 




40513 


10 Nov. 


1988 




clone K9-1 




40512 


10 Nov. 


1988 




JSC 308 




20879 


5 May 


1988 


20 


pS356 




67683 


29 April 


1988 




In addition, 


the following deposits were 


made on 11 May 




1989. 










25 


Strain 




Linkers 


ATCC No. 






D1210 


(Cfl/5-1-1) 


EF 


67967 






D1210 


(Cfl/81) 


EF 


67968 






D1210 


(Cf l/CA74a) 


EF 


67969 






D1210 


(Cfl/35f ) 


AB 


67970 




30 


D1210 


(Cfl/279a) 


EF 


67971 






D1210 


(Cfl/C36) 


CD 


67972 






D1210 


(Cfl/13i) 


AB 


67973 






D1210 


(Cfl/C33b) 


EF 


67974 






D1210 


(Cf l/CA290a) 


AB 


67975 




35 


HB101 


(AB24/C100 #3R) 




67976 
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The following derivatives of strain D1210 were deposited 
on 3 May 1989. 



Strain Derivative 


ATCC NO . 


pCFlCS/C8f 




pCFlAB/C12f 


blyoZ 


pCFlEF/14c 


67949 


pCFlEF/15e 


67954 


pCFlAB/C25c 


67958 


pCFlEF/C33c 


67953 


pCFlEF/C33f 


67050 


pCFlCD/33g 


67951 


pCFlCD/C39c 


67955 


pCFlEF/C40b 


67957 


pCFlEF/CA167b 


67959 



The following strains were deposited on May 12, 1989. 



Strain ATCC No. 

20 Lambda gtll(C35) 40603 

Lambda gtl0(beta-5a) 40602 

D1210 (C40b) 67980 

D1210 (M16) 67981 



25 The following biological materials were deposited on March 
23, 1990. 

Material ATCC No. 

5'-clone32 (in pUC18S) 68276 

30 
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10 



CLAIMS 

1. An oligomer capable of hybridizing to an HCV 
sequence in an analyte polynucleotide strand, wherein the 
oligomer is comprised of an HCV targeting sequence com- 
plementary to at least 4 contiguous nucleotides of HCV 
cDNA shown in Fig. 18. 

2. The oligomer of claim 1, wherein the target- 
ing sequence is comprised of nucleotides which are com- 
plementary to nucleotides selected from the following HCV • 
cDNA nucleotides shown in Fig. 18 , (nn x - nn denotes 
nucleotide number x to nucleotide number y) ) : 



15 



20 



25 



nn -340 




nn -330 ; 


nn -330 




nn_ 


320 ; 


nn_ 


320 


- nn_ 


310 


nn -310 




nn -300 ; 


nn -300 




nn_ 


290 ? 


nn_ 


290 


- nn_ 


280 


nn -280 




nn_ 27Q ; 


nn -270 




nn_ 


260 ? 


nn_ 


260 


- nn_ 


250 


nn -250 




nn -240 ; 


nn -240 




nn_ 


230 ; 


nn_ 


230 


- nn_ 


220 


nn -220 




nn_ 210 ; 


nn -210 




nn_ 


200 ; 


nn 


200 


- nn_ 


190 


nn -190 




nn -180 ; 


nn -180 




nn_ 


170 ; 


nn_ 


170 


- nn 


160 


nn -160 




nn -150'" 


nn -150 




nn_ 


140 ; 


nn_ 


140 


- nn_ 


130 


nn -130 




""-120' 


nn -120 




nn_ 


110'" 


nn_ 


110 


- nn_ 


100 


nn -ioo 




nn -90 ; 


nn -90 " 


nn -80 


; nn_ 


•80 


- nn 


-70 ; 





nn -70 " nn -60 ; 



l -60 



nn_ 4Q - nn_ 30 ; nn_ 3() 



- nn 



nn_ 10 - nn 1 , 



nn l " nn i0* 



-50' 
-20 ; 
nn 10 



'-50 



-40' 



nn_ 20 - nn_ 1Q ; 



nn 
nn 



30 nn 
nn 
nn 
nn 
nn 

35 nn 



30 
70 
110 
140 
170 
200 
230 
260 



- nn 4Q ; nn 4Q 



- nn 50 ; nn 50 



- nn 2Q ; 
- nn 



nn, 



20 



- nn 



60' nn 60 



- nn 



80' 
nn 



nn 



- nn 



- nn 

- nn 

- nn 



- nn 



120' 
150 ; 
180'* 
210 ; 
240 ; 
270 ; 



80 
nn 

nn 

nn 

nn 

nn 

nn 



- nn 

120 



90' 
nn 



nn 



90 



- nn 



150 
180 
210 
240 
270 



- nn 



- nn 



- nn 



- nn 



- nn 



130' 
160'" 
190* 
220 ; 
250'" 
280'" 



nn 



nn 



nn 



nn 



nn 
nn 



130 
160 
190 
220 
250 
280 



100' 
- nn 



nn 



100 



30' 
nn 7Q ; 

- nn 



110' 



- nn 



- nn 



- nn 

- nn 



- nn 



140 
170 
200 
230 
260 
290 



nn 2 go - nn 30 o; nn 300 - nn 310 ; nn 310 - nn 32 o 
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15 



20 



25 



30 



35 



nn 320 




nn 330 ; 


nn 330 




nn 340 ? 


nn 340 




nn • 

nn 35Q , 


""350 




nn 360 ; 


nn 360 




nn 37Q ; 


nn 370 




nn 380 ; 


nn 380 




n n * 

""390' 


nn 

nn 390 




n n • 

nn 400' 


n n 

nn 400 




nn • 
nn 410' 


nn 410 




nn 420 ; 


nn 420 




nn 430 ; 


n n 

nn 430 




nn • 

nn 440' 


nn 440 




nn • 

nn 450' 


nn 

nn 450 




nn 460' 


nn 




nn 470' 


nn 4?0 




nn 480 ; 


nn 480 




nn 4gQ ; 


nn 

nn 490 




nn • 

nn 500' 


nn 500 




nn 510 f 


nn 510 




nn • 

nn 520' 


n n 

nn 520 




nn • 

nn 530' 


nn 530 




nn 54Q ; 


nn 540 




nn 550 ; 


nn 550 




nn 560 ; 


nn 560 




nn 57Q ; 


nn 5?0 




nn 58Q ; 


nn 580 




nn 59Q ; 


nn 5gQ 




nn 600 ; 


nn 600 




nn 610 ; 


nn 610 




nn 620 ? 


nn 620 




nn 630 ; 


nn 630 




nn 640 ; 


nn 640 




nn 650 ? 


nn 650 




nn 660 ; 


nn 660 




nn g70 ; 


nn 670 




nn 680 ; 


nn 680 




nn 690 ; 


nn 690 




nn 700 ; 


nn 700 




nn 710 ; 


nn 710 




nn 720' 


nn 720 




nn 730 ; 


nn 730 




nn ?40 ; 


nn 740 




nn 75Q ; 


nn ?50 




nn 76Q ; 


nn 760 




nn 770 , 


nn 77Q 




nn 7g0 ; 


nn 780 




nn 790 ; 


nn ?go 




nn 800 ; 


nn 800 




nn 8ig ; 


nn 810 




nn 820' 


n rt 

nn 820 




nn • 

nn 830' 


nn 830 




nn 84Q ; 


nn 840 




nn 850 ; 


nn 850 




nn 860 ; 


nn 860 




nn 870 ; 


nn 870 




nn 880' 


nn 880 




nn • 

nn 890' 


nn 890 




nn 900' 


nn goo 




nn gi0 ; 


nn 91Q 




nn 920 ; 


nn 920 




nn 930 ; 


nn 930 




nn 94Q ; 


nn 940 




nn 950 ; 


nn 950 




nn g60 ; 


nn 960 




nn 970' 


nn 9?0 




nn g80 ; 


nn 980 




nn 990' 


nn 990 




nn iooo ; 


nn 1000 


- nn 1Q 



nn 1010 




nn 1020 ; 


nn 1020 




nn 1030 ; 


nn 1030 




nn 1040 


nn 1040 




nn 1050 ; 


nn 1050 




nn 1060 ; 


nn 1060 




nn 1070 


nn 1070 




nn 1080 ; 


nn 1080 




nn 1090 ; 


nn 1090 




nn 1100 


nn U00 




nn ul0 ; 


nn 1110 




nn 1120 ? 


nn 1120 




nn 1130 


nn U30 




nn 114Q ; 


nn 1140 




nn 1150 7 


nn 1150 




nn 1160 


nn 1160 




nn 117Q ; 


nn 117Q 




nn 1180 ; 


nn 1180 




nn 1190 


nn 119Q 




nn 1200 ? 


nn 1200 




nn 1210 ; 


nn 1210 




nn 1220 


nn 1220 




nn 1230 ; 


nn 1230 




nn 1240 ; 


nn 1240 




nn 1250' 


nn 1250 




nn 1260 ; 


nn 1260 




nn 1270 ? 


nn 1270 




nn 1280 


nn 1280 




nn 1290 ? 


nn 1290 




nn 1300 ? 


nn l300 




nn 1310' 


nn 1310 




nn 1320 ; 


nn 1320 




nn 1330 ; 


nn 1330 




nn 1340' 


nn 1340 




nn 1350 ; 


nn 1350 




nn 1360 ; 


nn 1360 




nn 1370' 


nn 1370 




nn 1380 ; 


nn 1380 




nn 1390? 


nn 1390 




nn 1400< 
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nn 1400 


- 


nn 141Q ; 




- 


nn 144Q ; 


""1460 


- 


nn 1470 ; 


nn 149Q 


- 


nn 1500 ; 


nn 1520 


— 


nn 1530 ; 


nn 1550 


— 


nn 1560 ; 


nn 1580 


— 


nn 1590 ; 


nn 1610 


— 
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3. The oligomer of claim 1, wherein the target- 
ing sequence is comprised of a sequence which is com- 
plementary to a sequence of at least 8 nucleotides present 
in a conserved HCV nucleotide sequence in HCV RNA. 

10 

4. The oligomer of claim 3, wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from the 5 '-terminus to about 200 in 
Fig. 18. 

15 

5. The oligomer of claim 3, wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from about 4000 to about 5000 in Fig. 
18. 

20 

6. The oligomer of claim 3, wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from about 8000 to about 9040 as shown 
in Fig. 18. 

25 

7. The oligomer of claim 3, wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from about -318 to about 174 as shown 
in Fig. 18. 

30 

8. The oligomer of claim 3, wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from about or from about 4056 to about 
4448 as shown in Fig. 18. 
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9. The oligomer of claim 3, wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from about 4378 to about 4902 as shown 
in Fig. 18. 

5 

10. The oligomer of claim 3, wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from about 4042 to about 4059 as shown 
in Fig. 18. 

10 

11. The oligomer of claim 3, wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from about 4456 to about 4470, as shown 
in Fig. 18 . 

15 

12. The oligomer of claim 3 f wherein the 
conserved sequence is located in the sequence of 
nucleotide numbers from about 8209 to about 8217, as shown 
in Fig. 18. 

20 

13. The oligomer of claim 3 f which is a 
capture probe. 

14. The oligomer of claim 3 f which is a label 

25 probe. 

15. The oligomer of claim 3, which is a 

primer . 

30 16. A process for detecting an HCV sequence in 

an analyte strand suspected of containing an HCV 
polynucleotide, wherein the HCV polynucleotide comprises a 
selected target region, said process comprising: 

(a) providing an oligomer capable of hybridizing 

35 to an HCV sequence in an analyte polynucleotide strand, 
wherein the oligomer is comprised of an HCV targeting 
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sequence complementary to at least 4 contiguous 
nucleotides of HCV cDNA shown in Fig. 18 

(b) incubating the analyte strand with the 
oligomer of (a) which allow specific hybrid duplexes to 
5 form between the targeting sequence and the target 
sequence; and 

(d) detecting hybrids formed between target 
region, if any, and the oligomer. 

10 

17. The process of claim 16 which further 

comprises : 

(a) providing a set oligomers which are primers 
for the polymerase chain reaction method and which flank 

15 the target region; and 

(b) amplifying the target region via a 
polymerase chain reaction method. 

18. A kit for detecting an HCV target sequence 
20 in an analyte strand, comprising the oligomer of claim 1 

packaged in a suitable container. 

19. A method for preparing blood free of HCV 
comprising: 

25 (a) providing analyte nucleic acids from a 

sample of blood suspected of containing an HCV target 
sequence; 

(b) providing an oligomer capable of hybrid- 
izing to the HCV sequence in an analyte polynucleotide 

30 strand, if any, wherein the oligomer is comprised of an 
HCV targeting sequence complementary to a sequence of at 
least 8 nucleotides present in a conserved HCV nucleotide 
sequence in HCV RNA; 

(c) reacting (a) with (b) under conditions 
35 which allow the formation of a polynucleotide duplex 
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between the targeting sequence and the target sequence, if 
any; 

(d) detecting a duplex formed in (c), if any; 

and 

5 (e) saving the blood from which complexes were 

not detected in (d). 
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:42.16.XT1 

GGTAGGGTCAAGGCTGAAATCGACTGTCTGCTTCTTTGGAGAAAGTGGTG 



:42.17.XT1 

ATCCTGGGGGAGCGTGATTGTCTCAATGGTCTTCTTTGGAGAAAGTGGTG 



:42.18.XT1 

AGTCCTGCCCCGACGTTGAGTGCGGGAGACCTTCTTTGGAGAAAGTGGTG 
•42.19.XT1 

CACAAATCTGTAGATGCCTGGCTTCCCCCTCTTCTTTGGAGAAAGTGGTG 
:42.20.XT1 

GTCGAACATGCCGGAGGGGCGCTCCCCCGGCTTCTTTGGAGAAAGTGGTG 
:42.21.LLA2C 

GCCTGCGTCATAGCACTCACAGAGGACGGATTAGGCATAGGACCCGTGTC 
:42.22.LLA2C 

AGTCTCGGCGGGCGTGAGCTCATACCAAGCTTAGGCATAGGACCCGTGTC 
:42.23.LLA2C 

CGGGGTGTTCATGTACGCTCGTAGCCTAACTTAGGCATAGGACCCGTGTC 
:42.24.LLA2C 

AAATTCAAGATGGTCCTGGCACACGGGAAGTTAGGCATAGGACCCGTGTC 
:42.25.LLA2C 

TATATGAGTGAGGCCTGTAAAGACGCCCTCTTAGGCATAGGACCCGTGTC 
:42.26.LLA2C 

ACTCTGCTTTGTCTGGGATAGAAAGTGGGCTTAGGCATAGGACCCGTGTC 
:42.27.LLA2C 

TTGGTACGCTACCAGGTAAGGAAGGTTCTCTTAGGCATAGGACCCGTGTC 
:42.28.LLA2C 

GGGAGGGGCTTGAGCCCTAGCGCACACGGTTTAGGCATAGGACCCGTGTC 
:42.29.LLA2C 

AATCAAACACTTCCACATCTGGTCCCACGATTAGGCATAGGACCCGTGTC 
:42.30.I£A2C 

GGGTGTTGGCCCATGGAGGGTGGGCTTGAGTTAGGCATAGGACCCGTGTC 
:42.31.LLA2C 

TTCATTCTGAACAGCGCCCAGTCTGTATAGTTAGGCATAGGACCCGTGTC 



FIG. 19-1 
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•42 XTl.l 

TCCTCACAGGGGAGTGATTCATGGTGGAGTCTTCTTTGGAGAAAGTGGTG 

• 4 2 XTl 2 

ATGGCTAGACGCTTTCTGCGTGAAGACAGTCTTCTTTGGAGAAAGTGGTG 

• 4 2 XTl 3 

TCCTGGAGGCTGCACGACACTCATACTAACCTTCTTTGGAGAAAGTGGTG 

• 4 2 XTl 4 

CGCAGACCACTATGGCTCTCCCGGGAGGGGCTTCTTTGGAGAAAGTGGTG 
•42 XTl. 5 

TCGTCCTGGCAATTCCGGTGTACTCACCGGCTTCTTTGGAGAAAGTGGTG 
:42.LLA2C.6 

GCATTGAGCGGGTTGATCCAAGAAAGGACCTTAGGCATAGGACCCGTGTC 
:42.LLA2C.7 

AGCAGTCTTGCGGGGGCACGCCCAAATCTCTTAGGCATAGGACCCGTGTC 
:42.LLA2C.8 

ACAAGGCCTTTCGCGACCCAACACTACTCGTTAGGCATAGGACCCGTGTC 
:42.LLA2C.9 

GGGGCACTCGCAAGCACCCTATCAGGCAGTTTAGGCATAGGACCCGTGTC 
:42.LLA2.10 

CGTGCTCATGGTGCACGGTCTACGAGACCTTTAGGCATAGGACCCGTGTC 
:42.LLA2C.ll 

GTTACGTTTGTTTTTTTTTTGAGGTTTAGGTTAGGCATAGGACCCGTGTC 
:42.LLA2C.12 

CGGGAACTTGACGTCCTGTGGGCGACGGTTTTAGGCATAGGACCCGTGTC 
:42.LLA2C.13 

CAAGTAAACTCCACCAACGATCTGACCGCCTTAGGCATAGGACCCGTGTC 
:42.LLA2C14 

GCGCACACCCAATCTAGGGCCCCTGCGCGGTTAGGCATAGGACCCGTGTC 
:42.LLA2C.15 

AGGTTGCGACCGCTCGGAAGTCTTTCTCGTTTAGGCATAGGACCCGTGTC 

FIG. 19-2 
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ATGTTGGGATGGGGCACAGTGACGGAGCCCCTTCTTTGGAGAAAGTGGTG 



ATCTCTCCGGTGGTGGACAGAGCAACCTCCCTTCTTTGGAGAAAGTGGTG 



:42. J4.AXJ. 

ACTTCGAGGGGGATAGCCTTGCCGTAAAAACTTCTTTGGAGAAAGTGGTG 
:42.35.XT1 

TGACAGAAGATGAGATGTCTCCCCCCCTTGCTTCTTTGGAGAAAGTGGTG 
:42.36.LLA2C 

TTTGCGGCGAGTTCGTCGCACTTCTTCTTTTTAGGCATAGGACCCGTGTC 
:42.37.LLA2C 

TAGGCCACGGCATTGATGCCCAATGCGACCTTAGGCATAGGACCCGTGTC 
:42.38.LLA2C 

GTCGGGATGACGGACACGTCAAGACCGCGGTTAGGCATAGGACCCGTGTC 
:42.39.LLA2C 

GCATCGGTTGCCACGACGACAACATCGCCGTTAGGCATAGGACCCGTGTC 
:42.40.LLA2C 

GAGTCGAAGTCGCCGGTATAGCCGGTCATGTTAGGCATAGGACCCGTGTC 
:42.4l.LLA2C 

GTCTGGGTGACACACGTATTGCAGTCTATCTTAGGCATAGGACCCGTGTC 
:42.42.IiA2C 

ATGGTGAAGGTAGGGTCAAGGCTGAAATCGTTAGGCATAGGACCCGTGTC 
.-42.43.LLA2C 

GAGACAGCATCCTGGGGGAGCGTGATTGTCTTAGGCATAGGACCCGTGTC 

FIG. 19-3 
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10 20 30 40 

EC10 GAATTCGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 

HCV1 CTCTCCCAGGCGCCACTGGACGACGCAAGGTTCCAATTGCTCT 

550 560 570 580 590 600 

50 60 70 80 A 90 100 

AACAGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTAGT 

AACGGGTCACCG<^TGGCATCGGATATGATGATC^ 

610 620 630 640 650 660 

110 120 130 140 150 160 

GGTAGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTG 

AATGGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTG 
670 680 690 700 710 720 

170 180 190 200 210 220 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCTTGGC 

• i ••»•••«•••«•) i » • « • • • » S ! • S • i ! S ! S ! S ! ! ! ! ! 2 2 S 2 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGT 
730 740 750 760 770 780 

230 240 250 260 270 280 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACTGGGGGGATCGCCGC 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGG 
790 800 810 820 830 840 

290 300 310 320 330 340 

CAAAACTACGGCTAGCCTTACTGGTCTCTTCAATTTAGGTGCCAAGCAGAACATCCAGCT 

5 5 2 2 2 t X Z * II I 2 * * * 552 ■••••••••«•• ••••••• 

CCAC&CTGTGTCTGGATTTGTTAGCCTCCTCGC^ 

850 860 870 880 890 900 

350 360 370 380 390 400 

GATCAACACCAACGGCAGTTGGCACATCAACAGGACGGCCTTGAACTGCAATGATAGCCT 
5 :::::: 5 ::::::::::::::: 2 5 : : : 5 55 :::::: : : : : : : 5 2 : 2 : 2 2 : 2 : : : : 
GATCAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCT 
910 920 930 940 950 960 

410 420 

CAACACCGGCTGGAATTC 
::::::::::: :X 

CAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCC 
970 980 990 1000 1010 1020 

FIG. 50-1 
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FIG. 51 



(p) 



84/86 

aa *1 17-308 Mutative envelope region) 

1) HCT #18 (USA) 3 clones sequenced 

2) JH23 (USA) ? 

3) JH 27 (USA) ? 

4) PBL-Th (USA) 2 clones sequenced 

5) EC1 (Italy) 3 clones sequenced 

6) HCV-1 (chimpanzee) multiple 

C/M«-j-»S 

1) 
2) 
3) 
4) 
5) 

QRNLGKVID7LTCGFADUVIGYIPLVGAPLGGAARALAHGVRVLEDGVNYATGNL 

1) H 

2) 

3) S T T 

4) L 

5) (F) S 

6) PGCSFSIFLLALLSCLTVPASAYQVRNSTGLYHVTNDCPNSSIVYEAADAILH 

1) (H) V V T 

2) A D V V K T 

3) S PVA N 

4) A ART 

5) H V T 

6) TPGCVPCVREGNASRCWVAMTPTVATRDGKLPATQLRRHIDLLVGSATLCS 
D 

2) I D 

3) D 
4) 

5) I 

6) ALWGE)LCGSVFLVGQLFTFSPRRHWTTCX3CNCSI 
SUMMARY: "S" AA1 17-308 (93%) 

HCT#18, PBL-Th, EC1 (Italy) have 97% homology with HCV-1 

JH23 and JH 27 have 96% and 95% homology with HCV-1, respectively 
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AA#300-438 ( C-termina l region of the putative envelope region and 
amino -1/3 of NSH 

1) JH23 

2) JH27 

3) Japanese isolate (T. Miyamura) 

4) EC10 (Italy) 



? 
? 
? 

2 clones sequenced 
(one nt difference, which did not 
result in an amino acid change) 
multiple 

s<— j— >NSI 

a v 

A 



5) HCV-1 (chimpanzee) 

D D 

2) D 

3) VS VM V 

4) 

5)TTQGCNCSIYPGHITGHRMAWDMMMNWSPTTALV^CaJ.RIPQAILDMIAGA 

1) M R ARSTA VA 

2) T YT N AR TQALT F 

3) L Y I M GH R VQ VT TLT 

4) A I A K TASLTA 

5) HWGVUGIAYFSMVGNWAKVLWL1LFAGVDAETHVTGGSAGHTVSGFVSL 

1 )FS R I I TV 

2) FT Dl I R AD 

3) FR S Kl V I R OF 

4) FNL I I R N 

5) LAPGAKQNVQLINTNGSWHLNSTALNCNDSLNTGWL 



SUMMARY: NS 1 AA 330-660 

"Isolate" ZHomology (AA330-A38) 

JH23 83 

JH27 80 

Japanese 73 

EC10 (Italy) 84 



%Homology (AA383-405) 

57 
39 
A8 
48 



FIG. 52 
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FIG. 53 
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